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Concurrent  MR-NIR  Imaging  for 
Breast  Cancer  Diagnosis 

Birsen  Yazici 


I.  INTRODUCTION 

Near  infrared  (NIR)  diffuse  optical  imaging  provides  quantitative  functional  information  from  breast  tissue  that 
can  not  be  obtained  by  conventional  radiological  methods.  NIR  techniques  can  provide  in  vivo  measurements 
of  oxygenation  and  vascularization  state,  the  uptake  and  release  of  molecular  contrast  agents  and  chromophore 
concentrations  with  high  sensitivity.  There  is  considerable  evidence  that  tumor  growth  is  dependent  on  angio¬ 
genesis  [8]-  [10],  and  that  tumor  aggressiveness  can  be  assessed  from  its  increased  number  of  new  vessels  and 
reduced  oxygenation  state  relative  to  normal  breast  tissue  and  benign  breast  lesions  [11]-  [13].  NIR  diffuse  optical 
tomographic  (DOT)  methods  has  the  potential  to  characterize  angiogenesis  related  vessel  density  as  it  measures 
the  total  hemoglobin  concentration  and  provide  the  ability  to  differentiate  between  benign  and  malignant  lesions 
based  on  oxygen  saturation.  Furthermore,  NIR  methods  are  non-ionizing,  relatively  inexpensive  and  can  be  made 
portable. 

The  diagnosis  and  management  of  cancer  involves  several  stages  where  magnetic  resonance  (MR)  plays  a 
valuable  and  growing  role.  MRI  of  the  breast  is  now  a  routine  part  of  the  clinical  care  in  many  centers  [16]- 
[18].  Magnetic  Resonance  imaging  (MRI)  is  indicated  in  patients  with  inconclusive  clinical  and/or  mammographic 
examinations.  Patients  that  may  benefit  include  women  with  radiographically  dense  breasts,  and  high  risk  potential 
population  [19]-  [20].  MRI  possesses  less  than  10%  contrast  for  soft  tissue  pathology  [21].  Gadolinium  (Gd) 
enhanced  MRI  offers  much  better  contrast  and  is  specific  for  tumor  vessel  imaging.  However,  the  signal  in  the 
Gd-MRI  arises  from  the  larger  vessels  as  the  contrast  agent  is  flushed  out  of  the  vascular  bed  of  the  tumor  [22]. 
In  comparison,  NIR  measurements  of  absorption  have  extremely  high  contrast.  It  was  reported  that  5%  change 
in  vascular  density  as  measured  histologically  in  ductal  carcinomas  leads  to  approximately  300%  contrast  in  NIR 
absorption  coefficients  [14].  Furthermore,  there  are  studies  suggesting  that  the  kinetics  of  contrast  enhanced  optical 
spectroscopy  provides  information  about  the  cellular  spaces  [15].  On  the  other  hand,  NIR  DOT  suffers  from  poor 
spatial  resolution  and  as  such,  it  is  unlikely  that  NIR  imaging  will  be  a  stand-alone  screening  method  in  the  general 


5 


population.  Therefore,  we  believe  that  the  concurrent  MR  and  NIR  imaging  brings  together  the  most  advantageous 
aspects  of  the  two  imaging  modalities  (structural  and  functional).  In  the  future,  we  envision  that  this  multimodality 
imaging  approach  will  lead  to  high  resolution  hemoglobin  tomography  and  comprehensive  quantitative  functional 
tissue  characterization  to  differentiate  malignant  and  benign  tumors. 

In  this  project,  the  clinical  studies  are  performed  using  the  novel  MR-NIR  hybrid  time-resolved  spectroscopy 
(TRS)  imager  and  fast  Indocynine  Green  (ICG)  enhanced  spectroscopic  imager  developed  by  Dr.  Chance,  a  Co-PI 
of  this  proposal,  at  the  University  of  Pennsylvania  (UPenn),  Biophysics  Department,  Diffuse  Optical  Imaging  and 
Spectroscopy  Laboratory. 

The  central  hypothesis  of  this  project  is  that  the  concurrent  MR-NIR  diffuse  optical  tomographic  methods  coupled 
with  fast  contrast  enhanced  NIR  spectroscopic  methods  provide  fundamentally  new  quantitative  functional  and 
structural  information  for  breast  cancer  tumor  characterization  and  detection.  This  new  information  can  be  obtained 
by  novel  modeling,  analysis  and  data  fusion  methods  from  the  tomographic,  temporal  and  cellular-based  contrast 
measurements,  which  exploit  fast  imaging  techniques  together  with  TRS  tomographic  methods.  In  this  project, 
we  investigate  new  methods  for  multi-modality  high  spatial  resolution  hemoglobin  tomography,  pharmacokinetic 
modeling  of  molecular  contrast  agents  based  on  fast  NIR  spectroscopy  and  analysis  of  structural  and  functional 
information  provided  by  MR  and  NIR  imaging  methods  for  breast  cancer  detection  based  on  receiver  operating 
characteristics  methodology.  Specific  aims  of  the  project  are  as  follows: 

•  Aim  1 :  Utilize  a  priori  anatomical  information  provided  by  MRI,  to  reconstruct  3D  high  resolution  hemoglobin, 
water  and  lipid  concentration,  and  oxygen  saturation  images  directly  from  6  wavelength  time  resolved  optical 
measurements.  Evaluate  improvements  in  image  reconstruction  between  that  of  stand-alone  NIR  and  concurrent 
MR-NIR  measurements  using  water  and  lipid  images  obtained  from  MRI. 

•  Aim  2:  Develop  a  compartmentalized  pharmacokinetic  modeling  of  ICG,  optical  contrast  agent,  and  extract 
quantitative  parameters  that  can  characterize  tumor  metabolism  and  angiogenesis.  Compare  ICG  kinetics  with 
the  Gadolinium,  MR  contrast  agent,  kinetics  and  biopsy  findings. 

•  Aim  3:  Evaluate  accuracy  of  breast  cancer  diagnosis  based  on  the  quantitative  functional  information  extracted 
from  stand-alone  NIR  system.  This  information  includes  hemoglobin,  water  and  lipid  concentration,  optical 
scatter  power  and  oxygen  saturation  images,  and  ICG  pharmacokinetic  parameters.  Evaluate  the  added  value 
of  ICG  kinetic  parameters  in  breast  cancer  diagnosis. 

•  Aim  4:  Combine  NIR  based  breast  cancer  diagnosis  features  with  the  systematic  MR  breast  architecture  and 
kinetics  interpretation  model  developed  by  Dr.  Nunes,  M.D,  Co-PI  of  this  proposal,  to  evaluate  the  sensitivity 
and  specificity  of  concurrent  MR-NIR  imaging  method.  Compare  results  with  that  of  stand-alone  MR  and  NIR 
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results. 

In  the  following  sections,  we  will  provide  detailed  description  of  our  current  research  in  line  with  the  statement 
of  work  (SOW)  and  the  aims  outlined  above.  For  the  period  of  June  1st,  2005  to  May  31st  2006,  SOW  includes 
only  the  first  two  aims  of  the  project. 


II.  BODY 

In  the  year  2  of  the  project,  SOW  includes  tasks  related  to  Aim  2  and  3  of  the  project.  Below,  we  describe  the 
works  that  has  been  performed  towards  the  Aim  2,  Task  3  and  Aim  3  Tasks  1  and  2.  We  also  improved  upon  the 
Task  1  of  Aim  1. 


A.  AIM  2  -  Tasks  3 

•  Task  3.  Extract  kinetic  parameters  from  invivo  NIR  measurements.  9-18th  month 

1 )  Pharmacokinetic  Modeling  and  Compartmental  Model  Analysis:  In  general,  the  continuous  time  state-space 
representation  for  an  n-compartment  model  is  given  by 

C{t)  =  K{aMt)  (1) 

m{t)  =  \  {(Xn)C{t)  (2) 


where  C(f)  denotes  the  concentration  vector  whose  elements  are  the  concentrations  of  the  fluorescence  agents  in 
different  compartments,  and  C{t)  denotes  its  time  derivative.  m{t)  is  the  bulk  fluorescence  concentration,  n  is  the 
number  of  compartments,  q:„  is  the  parameter  vector  whose  elements  are  the  pharmacokinetic  rate  constants  and 
volume  fractions.  K(Q:n)  is  the  system  matrix  with  entries  being  the  pharmacokinetic  rates,  and  \{an)  is  the  vector 
containing  the  volume  fractions  [3]. 

Although  the  formulation  for  the  direct  reconstruction  of  pharmacokinetic  rate  images  can  be  applicable  for 
n-compartment  models,  here  the  two-compartment  model  for  ICG  pharmacokinetics  will  be  our  running  example 
[5],  [6].  Based  on  a  two-compartment  model  for  ICG  kinetics,  as  shown  in  Figure  1,  the  ICG  transition  between 
the  two  compartments  can  be  modeled  by  two  coupled  differential  equations 

=  -koutCeit)  +  kinCpit),  (3) 


dCp{t) 


dt 


{kin  T  kglnn^Cp{P)  -\~  koutCe{l)  ■ 


(4) 
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Here,  C{t)  in  (1)  and  (2)  includes,  Cp{t)  and  Ce{t),  representing  the  ICG  concentrations  in  the  plasma  and 
extracellular  extravascular  space  (EES),  respectively.  The  parameter  vector,  a2,  includes,  kin,  kout,  keim,  Vp,  and 
Ve,\ 

Ot2  —  \kin  kout  kgim  '^p  ^e]  (5) 


where  kin  and  kout  are  the  pharmacokinetic  rates  that  govern  the  leakage  into  and  the  drainage  out  of  the  EES, 
keim  descrihes  the  ICG  elimination  from  the  body  through  kidneys  and  livers,  and  Vp  and  Ve  are  the  plasma  and 
EES  volume  fractions,  respectively. 


Capillary 

keim 

kin 

j 

kout 

EES 

Co 

Fig.  1.  Block  diagram  of  the  two-compartment  model  for  ICG  pharmacokinetics. 


Here,  to  obtain  2-D  images  of  pharmacokinetic  rates  and  fluorescence  agent  concentrations  in  different  compart¬ 
ments,  we  extend  the  compartmental  model  equation,  (1)  and  (2)  to  spatially  resolved  model  as  follows: 

C(ri,f)  K(Q;n(ri))  0  0  C(ri,f) 

:  =  0  0  :  +^(r,i)  (6) 

C{rN,t)  0  0  K(Q;„(rAr))  C{rN,t) 

where  cj(r,  f)  is  uncorrelated  zero  mean  Gaussian  processes  with  covariance  matrix  Q,  representing  the  small 
deviations  resulting  from  model  mismatch,  r  =  [ri  r2....rAr],  and  rj  is  the  location  of  the  voxel  for  j  = 
l,2,3...iV,  N  being  the  number  of  total  voxels  in  the  discretized  domain. 


m{ri,t) 


m{rN,t) 


V(Q;„(ri))  0  0 

0  ■•.  0 

0  0  \{an{rN)) 


C(ri,f) 


C{rN,t) 


+  r/(r,f) 


(7) 


where  r]{r,  t)  is  a  zero  mean  Gaussian  process  with  covariance  matrix  R  presenting  the  noise  in  the  measurements. 


The  implicit  form  of  (6)  and  (7)  can  be  expressed  as 


C(r,f)  =  K(Q:„(r))C(r,f)  +  cu(r,f)  (8) 

m(r,  t)  =  \{oLn{r))C{r,  t)  +  r]{r,  t).  (9) 

The  quantity  we  wish  to  reconstruct  is  the  spatially  varying  pharmacokinetic  rate  parameters  instead  of  the  absorption 
coefficient  of  the  florescence  agent  as  is  commonly  the  case  in  fluorescence  diffuse  optical  tomography  (FDOT) 
[1].  To  do  this,  we  first  need  to  develop  a  mapping  which  relates  the  boundary  flux  measurements  to  the  bulk 
florescence  agent  concentrations  in  tissue.  This  mapping  can  be  obtained  by  using  FDOT  forward  model  which 
will  be  explained  in  the  next  section. 

2)  Fluorescence  Diffuse  Optical  Tomography:  We  used  a  coupled  system  of  diffusion  equations  to  model 
fluorescence  light  propagation  in  tissue  [1].  Based  on  the  coupled  diffusion  equations,  the  forward  model  for 
FDOT  can  be  expressed  as: 

= /(Ma/(r,f)),  (10) 

where  t)  =  [^a/(ri,  f),  •  •  • ,  Paf{rN,  f)]  is  the  time  dependent  absorption  coefficient  of  the  fluorescence  agent, 

^'(r,  t)  =  f),  •  •  • ,  'I'(rAr,  f)]  is  the  time-dependent  boundary  flux  measurements,  and  /  is  a  nonlinear  function 

defined  by  the  coupled  diffusion  equation.  Under  the  assumption  that  the  exogenous  optical  properties  has  no  effect 
on  endogenous  optical  properties,  (10)  can  be  linearized  to  obtain 

(11) 

where  W  is  the  weight  matrix  relating  the  absorption  coefficients  to  the  boundary  measurements.  This  map  can  be 
numerically  calculated  using  finite  elements  methods  (FEM)  of  finite  different  methods  (FDM)  [4]. 

The  absorption  coefficient  of  the  fluorescence  agents  is  related  to  the  bulk  fluorescence  agent  concentration  as 
follows: 

=  em(r,f)  (12) 

where  e  is  the  extinction  coefficient  of  the  fluorescence  agent  at  the  excitation  wavelength. 

3 )  Pharmacokinetic  Rate  to  Measurement  Map  for  Fluorescence  Agents:  To  derive  a  complete  formulation  to 
reconstruct  the  pharmacokinetic  parameters  and  concentrations  in  different  compartments  using  the  photon  flux 
measurements,  we  combined  the  FDOT  forward  problem  equations,  (10)  or  (11)  with  the  compartmental  model 
equations. 
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Combining  (9)  and  (10),  with  the  knowledge  of  the  linear  relationship  between  the  absorption  coefficient  and  the 
bulk  fluorescence  concentration,  (12),  nonlinear  FDOT  forward  problem  is  related  to  the  bulk  fluorescence  agent 
concentration  as: 

^(r,  t)  =  g{m{r,  t))  ~  5r(V(Q:n(r))C(r,  f))  (13) 

where  g  is  the  non-linear  function  which  relates  the  bulk  fluorescence  agent  concentrations  to  the  boundary  flux 
measurements. 

The  linearized  version  of  (11)  is  given  as: 

^'(r,  f)  =  eWm(r,  t) 

=  eWV(«„(r))C(r,  t)  +  eWr7(r,  t) 

=  TC{r,t)  +  eWr]{r,t),  (14) 

where  W,  V  and  r]{r,t)  are  defined  above  and  T  =  eWV(Q;n(r)). 

The  equations  (8)  and  (13),  which  combine  the  FDOT  forward  problem  with  the  compartment  model  equations 
constitute  the  set  of  equations  which  will  be  used  for  the  direct  reconstruction  of  pharmacokinetic  rates,  volume 
fractions,  and  fluorescence  concentrations  in  different  compartments. 

4 )  Discretization  of  the  Compartmental  Model  Equations  and  a  Dynamic  Model  for  System  Parameters:  The 
source  detector  measurements  in  (9)  are  collected  at  discrete  time  instances,  t  =  kT,  k  =  0, 1, ...,  where  T  is  the 
sampling  period.  Therefore,  the  continuous  model  described  in  (8)  and  (14)  has  to  be  discretized.  To  simplify  our 
notation,  we  shall  use  C(r,  k)  =  C(r,  kT)  and  '4'(r,  k)  =  '4'(r,  kT). 

Let  On  denote  the  discrete-time  parameter  vector  of  the  pharmacokinetic  rates  and  volume  fractions.  The  parameter 
vector  0„(r)  can  be  either  time  dependent  or  time  independent.  The  formulation  given  in  this  work  can  be  used 
for  both  cases.  In  our  case,  the  pharmacokinetic  rates  and  the  volume  fractions  are  time  independent.  However,  in 
order  to  estimate  On  within  the  EKF  framework,  the  following  dynamic  model  is  introduced: 

0n{r,k  +  l)  =  0n{r,k)  +  (;{r,k),  (15) 

where  <j(r,  k)  is  a  zero  mean  white  noise  process  with  covariance  matrix  S  .  The  details  of  the  dynamic  model 
introduced  for  the  joint  estimation  of  the  system  parameters  and  the  states  can  be  found  in  [5]. 

5)  A  priori  information  for  Pharmacokinetic  Rates  and  Volume  Fractions:  To  improve  the  robustness  of  estimates 
of  the  parameters  we  impose  a  priori  information  on  pharmacokinetic  parameters  and  volume  fractions.  Here,  we 
assume  that  there  is  no  information  about  the  tumor  structure  and  used  a  4-pixel  neighborhood  model. 
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Using  the  4-pixel  neighbor  model  with  equal  weights  (5,  the  random  process,  Onir,  k),  can  be  modeled  as: 

4 

On{rj,k  +  1)  =  I38n{rj,k)  +  p'^0n{rji,k)  +  <i{r,k).  (16) 

i=l 

6)  Extended  Kalman  Filter  (EKE)  Formulation:  In  our  state-space  model,  (8)  and  (9),  both  the  states  (con¬ 
centrations)  and  system  parameters  (pharmacokinetic  rates  and  volume  fractions)  are  unknown.  In  this  case,  the 
state-space  model  can  be  regarded  as  a  non-linear  model  in  which  system  parameters  and  states  are  combined  to 
form  the  new  states  of  the  non-linear  model.  This  system  is  then  linearized  and  solved  for  the  unknown  states  using 
EKF  framework  [5].  To  solve  for  the  concentrations  and  the  unknown  parameters,  the  parameter  vector  0„(r,  k)  is 
inserted  into  the  concentration  vector  C(r,  /c)  as: 

C(r, /c  +  1)  _  K(0„(r,  A:))C(r, /c)  ^  ^{r,k) 

6n{rj,k  +  l)  (i8n{rj,k)  +  l3Y!l^^en{rji,k)  <j(r,/c) 

The  measurement  equation  for  the  non-linear  case  is  given  as: 

r  1 

^'(r,/c)=  5(V(a„(r))C(r,A:))  0  ’  +r/(r,fe).  (18) 

[  0„(r,A:) 

Without  explicit  proof,  the  extended  Kalman  filtering  algorithm  for  simultaneous  estimation  of  concentrations 


and  parameters  for  the  nonlinear  case,  (13),  is  given  follows: 

C(r,0)  1  ^  r  i?(C(r,0)) 

0„(r,O)  0n(r,O) 

r  Uar(C(r,0))  0 

Po,o  = 

0  Sd 


(19) 

(20) 


where  E  denotes  the  expected  value  of  C(r,  0),  P  is  the  error  covariance  matrix,  and  is  the  preassigned  covariance 


matrix  of  the  unknown  system  parameters. 


The  following  equations  describe  how  the  concentration  estimates  and  error  covariance  matrix  are  updated  at  the 
k^^  time  instant  given  all  the  measurements  up  to  {k  —  1)*^  time  instant.  For  A:  =  1,  2, ..., 

C(r,A:|A:-l)  K(0„(r,  A:  -  l))C(r,  A:  -  1) 

en{rj,k\k  -  1)  f36n{rj,k-  1)  + /3  ZlLi  A:) 

Pfc|fc_i  =  J(A:  -  l)Pfc_i,fc_iJ^(A:  -  1)  +  °  ,  (22) 

0  Srf 
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where  J  is  the  Jacohian  matrix  given  hy 

K(0n(r,fe))  ^[K(0„(r,A:))C(r,i 

^0n(r, /c) 

Gfc  =  Pfc,fc-iA^[APfc^fe_iA^  +  R]“^, 


J(A:)  = 


where  Gk  is  the  recursive  Kalman  gain,  R  is  the  covariance  matrix  of  the  measurements  and  A  is: 

T 


A  = 


^g{\{en{r,k\k-l)C{r,k\k-l)) 


_d_  _ 

dO  \oC 


y{\{0n{r,k\k  -  l)C{r,k\k  -  l) 

Pfc.fc  =  [I  —  GfeA]Pfc^fc_i. 


where  I  is  the  identity  matrix. 

C(r,fc) 


C{r,k\k-1) 

pen{rj,k\k  -  1)  +/3I]Li  On{rji,k\k  -  1) 


+Gfc(^'(r,  k)  -  ff(V(0„(r,  k\k  -  l)C(r,  k\k  -  1)). 


(23) 

(24) 


(25) 

(26) 


B.  Clinical  Results 

1 )  Apparatus:  We  used  the  data  collected  with  a  continuous  wave  (CW)  NIR  imaging  apparatus.  The  apparatus 
has  16  light  sources,  which  are  tungsten  hulhs  with  less  than  1  watt  of  output  energy.  They  are  located  on  a  circular 
holder  at  an  equal  distance  from  each  other  with  22.5  degree  apart.  Sixteen  detectors,  namely,  silicon  photodiodes, 
are  situated  in  the  same  plane.  The  hreast  is  arranged  in  a  pendular  geometry  with  the  source-detector  prohes  gently 
touching  its  surface.  Figure  2  illustrates  the  configuration  of  the  apparatus  and  the  configuration  of  the  detectors 
and  the  sources  in  a  circular  plane.  A  hand  pass  filter  at  805nm,  the  absorption  peak  of  ICG,  is  placed  in  front  of 
the  sources  to  select  the  desired  wavelength.  A  set  of  data  for  one  source  is  collected  every  500  ms.  The  total  time 
for  a  whole  scan  of  the  hreast  including  16  sources  and  16  detectors  is  8.8  seconds.  The  detectors  use  the  same 
positions  as  the  sources  to  collect  the  light  originating  from  one  source  at  a  time.  Only  the  signals  from  the  farthest 
11  detectors  are  used  in  the  analysis.  For  example,  when  Source  1  is  on,  the  data  is  collected  using  detectors  4  to 
14.  A  more  detailed  explanation  of  the  apparatus  and  the  data  collection  procedure  can  he  found  in  [7]. 

2)  Protocol:  Patients  with  suspicious  hreast  tumors  were  enrolled  for  this  study.  ICG  was  injected  intravenously 
hy  holus  with  a  concentration  of  0.25  mg  per  kg  of  body  weight.  Diagnostic  information  is  obtained  using  biopsy 
results.  Since  biopsy  modifies  the  blood  volume  and  blood  flow  around  the  tumor  region,  measurements  were  made 
before  the  biopsy.  Data  acquisition  started  before  the  injection  of  ICG  and  continued  for  10  minutes. 
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Fig.  2.  The  cut  section  of  the  16  light  source-detector  device,  holding  a  human  breast  inside.  The  diameter  can  be  fitted  easily.  The  16 
light  source-detector  combinations  in  each  arm  are  located  equal  distance,  but  when  the  device  fits  the  breast,  only  the  diameter  chances. 


t  =44sec  in  t  =88sec  4u  I  =132sec 


-4  -2  024  -4  -2  024  -4  -2  024 

X  [cm]  X  [cm]  X  [cm] 


Fig.  3.  Differential  absorption  reconstruction  images  for  a  set  of  time  instants  for  Case  1. 

3)  Tumor  Information:  Three  different  patients  with  different  tumor  types  are  included  in  this  study.  First  case, 
Case  1,  is  fibroadenoma,  which  corresponds  to  a  mass  estimated  to  he  1—2  cm  in  diameter  within  a  hreast  of  9  cm 
diameter.  Second  case.  Case  2,  is  adenocarcinoma  corresponding  to  a  tumor  estimated  to  he  2—3  cm  in  diameter 
within  a  hreast  of  7.7  cm  diameter.  The  third  case.  Case  3,  is  invasive  ductal  carcinoma,  which  corresponds  to  a 
mass  estimated  to  he  3—4  cm  in  diameter. 

4)  ICG  Concentration  Measurements  for  Pharmacokinetic  Parameter  Estimations:  Using  the  CW  imager  de- 
scrihed  above,  for  each  patient,  sufficient  number  of  source  detector  readings  were  collected  from  different  angles. 
Reconstruction  of  differential  absorption  images  using  the  source-detector  readings  was  discussed  in  Intes  et  al.  [2]. 
A  sample  set  of  reconstructed  differential  absorption  images  for  Case  1,  Case  2,  and  Case  3  for  9  selected  time 
instants  are  shown  in  Figure  3,  4,  and  5,  respectively. 

Using  the  linear  relationship  between  ICG  concentration  and  absorption  coefficient,  we  obtained  ICG  concentra¬ 
tion  images  from  differential  absorption  images  for  each  case.  A  sample  set  of  ICG  concentration  images  for  the 
selected  time  instants  is  shown  in  Figure  6,  7,  and  8  for  Cases  1,  2  and  3,  respectively.  Here,  the  concentration 
images  represent  bulk  ICG  concentration  in  the  tissue,  not  specifically  in  plasma  or  the  EES. 


13 


Fig.  4.  Differential  absorption  reconstruction  images  for  a  set  of  time  instants  for  Case  2. 
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Fig.  5.  Differential  absorption  reconstruction  images  for  a  set  of  time  instants  for  Case  3. 
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Fig.  6.  2-D  ICG  concentration  images  for  a  set  of  time  instants  for  Case  1 
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Fig.  7.  2-D  ICG  concentration  images  for  a  set  of  time  instants  for  Case  2. 


-2  0  2  -2  0  2  -2  0  2 
X  [cm]  X  [cm]  X  [cm] 


Fig.  8.  2-D  ICG  concentration  images  for  a  set  of  time  instants  for  Case  3. 


Using  the  ICG  concentration  curves,  we  estimated  the  pharmacokinetic  parameters  for  each  pixel  based  on  the 
two-compartment  model.  We  then  constructed  2-D  permeability  rate  images  using  values  of  these  parameters.  2-D 
images  for  kin,  kout  for  two-compartmental  model  for  each  case  are  shown  in  Figures  9  (a),  (b),  10  (a),  (b),  and 
1 1  (a),  (b),  respectively.  We  also  analyzed  whether  the  kinetic  rates  are  statistically  different  or  not  for  the  inside 
and  outside  the  tumor  region.  The  kin  and  kout  values  from  inside  and  outside  the  tumor  region  are  statistically 
different  with  a  p-value  of  less  than  0.0001  for  all  cases. 

We  constructed  2-D  ICG  concentration  images  for  plasma  and  the  EES.  Eigures  12-17  show  the  ICG  concentration 
in  the  EES  and  plasma  for  3  different  time  instants  for  Case  1,  2,  3,  respectively.  We  observed  that  ICG  concentrations 
in  plasma  and  the  EES  compartments  are  higher  around  the  tumors  agreeing  with  the  hypothesis  that  around  tumor 
region  ICG  may  act  as  a  diffusible  extravascular  flow  in  leaky  capillary  of  cancer  vessels. 

C.  AIM  3  -  Tasks  1  and  2 

The  SOW  with  regard  to  Aim  3  includes  the  following  specific  tasks: 
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(a)  (b) 

Fig.  9.  2-D  images  for  pharmacokinetic  rates  kin  and  kout  for  Case  1. 


■I  ■ 


(a)  (b) 

Fig.  10.  2-D  images  for  pharmacokinetic  rates  ki„  and  kout  for  Case  2. 


(a)  (b) 

Fig.  11.  2-D  images  for  pharmacokinetic  rates  ki„  and  kout  for  Case  3. 
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(a) 


(b) 


(c) 


Fig.  12.  2-D  ICG  concentration  images  in  plasma  for  Case  1  for  the  246.4*^,  334.4*^,  and  422.4*'*  seconds. 
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(a)  (b) 

Fig.  13.  2-D  ICG  concentration  images  in  the  EES  for  Case  1  for  the  246.4*^,  334. 4‘^ 


(a) 


(b) 


Fig.  14.  2-D  ICG  concentration  images  in  plasma  for  Case  2  for  the  228.8*^,  316.8*^, 
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(a) 


(b) 


Fig.  15.  2-D  ICG  concentration  images  in  the  EES  for  Case  2  for  the  228.8*^,  316.8*^ 
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Fig.  16.  2-D  ICG  concentration  images  in  plasma  for  Case  3  for  the  246.4*^,  378.4*^,  and  510.4*^  seconds. 


17 


(a)  (b)  (c) 

Fig.  17.  2-D  ICG  concentration  images  in  the  EES  for  Case  3  for  the  246.4*^,  378.4*^,  and  510.4*^  seconds. 


•  Task  1.  Determine  statistical  variability  of  each  NIR  feature  inside  and  outside  the  suspected  tumor  in  an 
individual  and  evaluate  the  statistical  significance  of  the  measured  difference  with  the  instrumentation  precision. 
12-18th  month 

•  Task  2.  Design  statistical  classifiers  fo  determine  the  ROC  of  each  NIR  feature  for  an  individual.  18-24th 
month 

•  Task  3.  Evaluate  the  ROC,  positive  predicted  value  (PPV)  and  negative  predicted  value  (NPV)  of  various 
combinations  of  the  NIR  features  for  an  individual.  24-27 th  month 

•  Task  4.  Investigate  the  significance  of  the  measured  difference  between  malignant  and  benign  tumor  patient 
groups  for  single  and  combined  NIR  features.  27-30th  month 

Year  2  tasks  involve  Task  1  and  2. 

•  Task  1  and  2.  The  evaluation  of  ICG  related  parameter  is  partly  addresses  in  Task  3  of  Aim  2. 

This  work  describes  the  characterization  efficiency  of  incremental  biochemical  and  physiological  properties  of 
breast  tumors  [23].  In  this  work  an  NIR  method  which  is  capable  of  rapidly  acquiring  data  from  the  human 
breast  with  a  handheld  spectroscopy  apparatus  is  used.  The  device  measures  relative  increases  of  tumor  hemoglobin 
concentration  and  relative  hemoglobin  desaturation,  all  data  being  taken  on  a  relative  basis  using  generally  the  mirror 
image  site  on  the  contralateral  breast,  substantially  mitigating  the  multiple  effects  of  variable  demographic  and 
structural  features  of  the  human  breast  by  measuring  relative  hemoglobin  concentration  and  oxygenation  compared 
to  a  normal  breast  within  subjects.  In  analyzing  relative  hemoglobin  concentration  against  relative  saturation,  a 
nomogram  display  conveniently  segregates  verified  cancers  from  cancer-free  breasts  over  a  wide  range  of  tumor 
sizes  and  types. 

1)  Apparatus:  In  this  study,  a  continuous  wave  (CW)  near  infra  red  spectrometer  (NIRS)  is  used  (Fig.  I)  [23].  In 
the  center  of  the  probe  there  is  a  3-wavelength  LED.  The  LED  intensity  is  low,  10-15  mA.  The  probe  consisted  of 
one  multi-wavelength  LED  as  a  light  source  and  8  silicon  diodes  as  detectors  (Fig.  2).  These  8  detectors  surround 
the  LED  at  4  cm  distance,  so  that  8  locations  over  a  9  cm  diameter  area  from  a  breast  can  be  measured.  The 
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Fig.  18.  A  photograph  of  the  whole  apparatus  (Fig.  lA)  illustrates  the  handheld  puck  or  probe,  the  coupling  to  the  circuit  box  which 
contains  the  drivers  for  the  LED,  the  amplifiers  for  the  OPTI-lOls,  the  digitally  controlled  gain  adjustment  amplifier,  the  electronic  switch 
which  decodes  the  light  pulses  and  stores  the  information  in  a  memory  capacitor,  the  second  set  of  switches  which  sample  the  memory 
capacitor  at  a  rate  compatible  with  the  computer  ADC,  the  software  for  computing  blood  concentration  and  blood  saturation  and  the  display 
on  the  computer  which  serves  to  normalize  the  signals  through  the  digitally  controlled  amplifier. 

light  sources  are  flashed  alternately  100  times  per  second  and  electronic  circuits  are  designed  to  amplify  and  time 
separate  the  signals  in  a  sample  and  hold  circuit  which  integrates  the  signals  over  an  interval  of  a  few  seconds 
sufficiently  rapidly  to  follow  the  movements  of  the  sensor  over  the  breast,  allowing  usually  10  seconds  for  any 
particular  sensor  position.  The  data  are  then  digitized  and  presented  as  a  running  time  display  so  that  the  operator 
can  be  sure  that  stable  readings  are  reached  at  each  position  of  the  sensor.  The  light  intensity  from  the  8  detectors 
were  adjusted  to  be  near  1  volt  by  gains  set  and  calibrated  with  a  phantom  with  known  absorption  and  scattering 
coefficients  (/Tq  =  0.04  to  0.07  and  ji'g  -  8  cm~^).  The  puck  (Fig.  2)  is  then  transferred  to  the  contralateral  breast  to 
include  the  mirror  image  location  of  the  suspected  cancer.  The  signal  outputs  from  all  source  detector  combinations 
are  recorded.  The  probe  is  then  moved  from  the  contralateral  breast  to  the  ipsilateral  breast  suspected  of  cancer. 
The  sensors  giving  the  largest  changes  with  respect  to  the  mirror  image  position  on  the  contralateral  breast  are 
related  to  the  suspected  cancer.  The  procedure  requires  less  than  10  minutes. 

2)  Subjects:  This  study  includes  two  Clinical  Centers:  the  Abramson  Family  Cancer  Research  Institute/Department 
of  Radiology  of  the  Hospital  of  University  of  Pennsylvania  (HUP),  and  the  Department  of  Gynecology  of  Leipzig 
University.  The  population  targeted  at  HUP  are  those  who  have  come  for  possible  biopsy  and  for  radiology.  The 
second  group  has  come  to  the  Breast  Cancer  Clinic  at  Leipzig.  HUP  and  Leipzig  provided  24  and  20  cancer  patients, 
respectively,  and  64  and  8  non  cancer  disease  patients,  respectively.  Skill  has  been  required  in  order  to  ensure  all 
sensors  were  pressed  upon  the  breast  with  equal  pressure  (~  3  mmHg).  Patients  were  lying  on  their  backs  and  the 
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Fig.  19.  The  near  infrared  probe  which  8  sources  and  8  detectors. 

puck  was  placed  on  the  breast  in  such  a  way  that  coronal  breast  scan  data  is  acquired  (perpendicular  to  gravity). 

3)  Feature  Extraction:  The  features  that  is  used  for  tumor  characterization  may  be  computed  using  Ae  lcm“^ 
and  AL  =  4cm  x  5  for  pathlength  factors  of  5,  AOD  =  Ae.AC.AL  where  OD  is  the  optical  density  ,  £  is  the 
extinction  coefficient,  C  is  blood  concentration,  and  L  is  the  mean  pathlength  of  photons. 

Then  relative  blood  concentration,  ABV  and  oxygen  saturation,  ADeoxy,  can  be  approximated  at  two  different 
wavelengths  by 

ABM  oc  0.3AOi473Q  +  AOD^^q  (27) 

ADeoxy  oc  IAAOD^^q  —  AOD^^q  (28) 

The  concentrations  of  Hb  (hemoglobin),  Hb02  (oxy  hemoglobin)  were  calculated  by  a  modified  Beer-Lamberf 
Law  where  I  is  lighf  intensity  after  absorption  and  scattering  and  and  using  known  extinction  coefficients  of  Fbb, 
Hb02  and  differential  pathlength  factors  [24]. 

ABV  and  ADeoxy  can  be  calculated  by  using 

ABV  oc  A[Hb]  +  A[Hb02]  (29) 

ADeoxy  oc  A[Hb02]  -  A[Hb]  (30) 

Here  it  is  important  to  note  that  ABV  and  ADeoxy  were  based  on  a  lipid  blood  oxygen  model.  Thus  the 
increments  of  BV  and  Deoxy  are  relative  to  the  contralateral  breast: 

ABV  =  ABVtumor  -  ^BVcontra  (31) 

ADeoxy  =  ADeoxytumor  -  ^Deoxycontra  (32) 

where  ABVtumor,  ^BVcontra  hTC  ABV  in  the  tumor  breast  and  the  mirror  image  position  of  the  contralateral 
breast,  respectively,  and  ADeoxytumor,  ^Deoxycontra  are  ADeoxy  in  the  tumor  breast  and  the  mirror  image 
position  of  the  contralateral  breast,  respectively. 
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4 )  Feature  Analysis  and  Classification  for  Cancer  Characterization:  In  this  subsection,  we  present  the  set  of 
tumor  classification  features  and  the  malignancy  differentiation  criteria.  FI  denotes  ABV,  F2  denotes  ADeoxy, 
and  F3  denotes,  S,  size  of  the  tumor.  We  evaluate  the  malignancy  differentiation  capability  of  the  individual  features 
and  various  combinations  of  the  these  features. 

As  a  malignancy  differentiation  criterion,  we  used  the  Neyman- Pearson  hypothesis  testing  method  [25].  Neyman- 
Pearson  decision  rule  maximizes  true  positive  rate  subject  to  a  given  false  positive  rate.  True  positive  rate  is  defined 
as  the  ratio  between  the  number  of  correct  malignant  decisions  and  total  number  of  malignant  cases.  False  positive 
rate  is  the  ratio  between  the  number  of  incorrect  malignant  decisions  and  total  number  of  benign  cases. 

Let  F  be  a  set  of  features  chosen  from  FI  -  F3.  The  Neyman-Pearson  statistics  is  given  by  the  following  likelihood 
ratio: 


P{F  \Hi 


=  m 


(33) 


P(F  \Ho) 

where  P{F  \Ho)  is  the  conditional  probability  of  F  given  that  the  tumor  is  benign,  and  P{F  \Hi)  is  the  conditional 
probability  of  F  given  that  the  tumor  is  malignant.  In  Neyman-Pearson  likelihood  ratio  test;  a  threshold  value  Tq, 
is  chosen  so  that  the  false  positive  probability  is  equal  to  a,  i.e.. 


a  =  P{l(F)  >  Ta\Ho) 


(34) 


This  results  in  the  following  malignancy  differentiation  criteria  : 

Flo  ■  HP)  <  fa  Benign  tumor 

Hi  :  1(F)  >  Ta  Malignant  tumor 

Neyman-Pearson  test  is  implemented  using  various  classifiers,  namely,  k-nearest  neighbor  classifier  (KNNC), 
Parzen  density  based  classifier  (PAR),  Automatic  neural  network  classifier  (NEURC),  Normal  densities  based  linear 
classifier  (LDC),  Nearest  mean  classifier  (NMC),  Scaled  nearest  mean  classifier  (NMSC),  Normal  densities  based 
quadratic  (multi-class)  classifier  (QDC),  Uncorrelated  normal  densities  based  quadratic  classifier  (UDC).  The  more 
details  information  on  these  classifiers  can  be  found  in  [26]. 

We  evaluated  the  malignancy  differentiation  capability  of  the  following  individual  and  combined  features: 

FI:  ABV 

F2  :  ADeoxy 

F3  :  Tumor  Size,  (S) 

F1-F2:  ABV  and  ADeoxy 
F1-F2-F3:  ABV,  ADeoxy,  and  S 
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TABLE  I 

AUC  VALUES  FOR  DIFFERENT  CLASSIFIERS  FOR  F1-F2-F3:  ABV ,  ADeoxy,  S 


Type 

NMSC 

PAR 

LDC 

UDC 

NEURC 

QDC 

NMC 

KNNC 

AUC 

0.9098 

0.9041 

0.9017 

0.8984 

0.8864 

0.8843 

0.8807 

0.8752 

TABLE  II 

AUC  VALUES  FOR  DIFFERENT  CLASSIFIERS  FOR  F1-F2:  ABV,  ADeoxy 


Type 

NMSC 

PAR 

LDC 

UDC 

NEURC 

QDC 

NMC 

KNNC 

AUC 

0.9001 

0.8993 

0.8930 

0.8908 

0.8992 

0.8821 

0.8782 

0.8645 

The  evaluation  is  based  on  receiver  operating  characteristics  (ROC)  methodology.  The  ROC  curve  is  obtained 
by  plotting  the  probability  of  false  positive  versus  the  probability  of  detection.  The  evaluation  of  the  classification 
methods  is  based  on  the  area  under  the  ROC  curve  (AUC).  The  best  performing  feature  set  is  the  combination  of 
the  three  features.  Table  1  gives  the  AUC  values  for  8  different  classifiers  for  all  three  features.  The  scaled  nearest 
mean  classifier  has  the  best  performance  in  terms  of  classification  with  a  AUC  value  of  0.9098.  Table  2  gives 
the  AUC  values  for  8  different  classifiers  for  the  features  ABV  and  ADeoxy.  The  scaled  nearest  mean  classifier 
has  the  best  performance  in  terms  of  classification  with  a  AUC  value  of  0.9001.  Table  3  gives  the  AUC  values 
for  8  different  classifiers  for  the  feature  ABV.  The  nearest  mean  classifier  has  the  best  performance  in  terms  of 
classification  with  a  AUC  value  of  0.8832.  Table  4  gives  the  results  8  different  classifiers  for  the  feature  ADeoxy. 
The  nearest  mean  classifier  has  the  best  performance  in  terms  of  classification  with  a  AUC  of  0.8790. 

Figures  3  and  4  show  the  distribution  of  features  ABV  and  ADeoxy  extracted  from  benign  and  malignant 
tumors.  The  thresholds  computed  using  scaled  nearest  mean  classifier  and  Parzen  classifiers.  In  Fig.  5,  ROC  curves 


TABLE  III 

AUC  VALUES  FOR  DIFFERENT  CLASSIFIERS  FOR  FI:  ABV 


Type 

NMSC 

PAR 

LDC 

UDC 

NEURC 

QDC 

NMC 

KNNC 

AUC 

0.8817 

0.8764 

0.8807 

0.8779 

0.8513 

0.8778 

0.8832 

0.8302 

TABLE  IV 

AUC  VALUES  FOR  DIFFERENT  CLASSIFIERS  FOR  F2:  ADeoxy 

Type 

NMSC 

PAR 

LDC 

UDC 

NEURC 

QDC 

NMC 

KNNC 

AUC 

0.8787 

0.8764 

0.8776 

0.8711 

0.8491 

0.8613 

0.8790 

0.8331 
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Fig.  20.  Scaled  Nearest  Mean  Classifier  and  F1-F2  2-D  data  clustering. 


Fig.  21.  Parzen  and  F1-F2  2-D  data  clustering. 

for  all  of  the  features  and  the  best  two  features,  namely,  ADeoxy  and  ABV  are  plotted  using  scaled  nearest 
mean  classifier.  The  observed  area  under  the  ROC  curve  for  all  the  features  and  F1-F2  are  0.9098  and  0.9001, 
respectively.  In  Fig.  6,  ROC  curves  for  individual  features  are  plotted  using  nearest  mean  classifier.  The  observed 
area  under  the  ROC  curve  for  ADeoxy  and  ABV  are  0.8832  and  0.8790,  respectively. 
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Fig.  22.  ROC  curves  for  F1-F2-F3  and  F1-F2  using  NMSC  Classifier. 


Fig.  23.  ROC  curves  for  FI  and  F2  using  NMC  Classifier. 
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III.  Key  Research  Accomplishments 

1)  We  developed  a  new  adaptive  discretization  method  based  on  MR  a  priori  information  for  diffuse  optical 
image  reconstruction. 

2)  Born  approximation  based  linearization  is  widely  used  in  practical  NIR  image  reconstruction  algorithms.  We 
derived  the  Born  expansion  and  Frechet  derivatives  of  the  coefficient-to-measurement  map  in  diffuse  optical 
tomography  and  we  analyzed  the  error  due  to  order  Born  expansion  in  diffuse  optical  tomography. 

3)  We  completed  the  work  on  two-level  domain  decomposition  methods  for  diffuse  optical  tomography  and 
submitted  as  a  journal  paper.  The  work  is  accepted  to  appear  in  Inverse  Problems,  and  Institute  of  Physics 
Journal. 

4)  We  developed  a  direct  reconstruction  method  for  spatially  resolved  pharmacokinetic  rate  images  of  indocyanine 
green  and  applied  this  method  to  human  breast  data.  This  work  is  now  being  prepared  as  a  journal  publications. 

5)  We  performed  an  ROC  analysis  of  NIR  parameters  collected  from  116  patients  using  a  handheld  NIR 
spectroscopy  instrument  and  compared  the  discrimination  capability  of  different  NIR  parameters  using  various 
statistical  classifiers  and  ROC  methodology.  This  work  is  now  being  prepared  as  a  journal  publication. 

6)  We  collected  concurrent  MR-NIR  tomographic  data  and  ICG  data  from  10  patients.  We  are  currently  pro¬ 
cessing  this  data. 


IV.  Reportable  Outcomes 

Complete  list  of  reportable  outcomes  is  given  below: 

1)  B.  Alacam,  B.  Yazici,  X.  Intes,  B.  Chance,  ’’Extended  Kalman  filtering  for  the  modeling  and  analysis  of 
ICG  pharmacokinetics  in  cancerous  tumors  using  NIR  optical  methods,”  to  appear  in  IEEE  Transactions  in 
Biomedical  Engineering. 

2)  B.  Alacam,  B.  Yazici,  X.  Intes,  B.  Chance,  ’’Analysis  of  ICG  pharmacokinetics  in  cancerous  tumors  using 
NIR  optical  methods”  in  Proc.  of  EMBS-llth  Anniversary  Conference,  September,  2005,  Shanghai,  China, 
pp.  62  -  65. 

3)  B.  Alacam,  B.  Yazici,  X.  Intes,  S.  Nioka,  B.  Chance,  ’’Spatially  resolved  pharmacokinetic  rate  images  of  ICG 
using  Near  Infrared  optical  methods”,  Proc.  of  2006  SPIE  Photonic  West,  San  Jose,  California  USA,  21-  26 
January  2006,  vol.  6088.  pp.  455-464. 

4)  B.  Alacam,  B.  Yazici,  X.  Intes,  S.  Nioka,  B.  Chance,  ’’Direct  reconstruction  of  pharmacokinetic  rate  Images 
of  Indocyanine  Green  in  fluorescence  molecular  tomography”  Proceedings  of  Biomedical  Optics  Topical 
Meeting,  Fort  Lauderdale,  Florida,  USA  ,  19-22  March,  2006,  In  print. 
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5)  B.  Alacam,  B.  Yazici,  A.  Serdaroglu,  X.  Intes,  B.  Chance,  ’’Reconstruction  of  spatially  resolved  pharmacoki¬ 
netic  rate  images  of  fluorescence  agents  in  FDOT”  to  appear  in  Proc.  of  EMBS-28th  Anniversary  Conference, 
September,  2006,  New  York. 

6)  B.  Alacam,  B.  Yazici,  X.  Intes,  B.  Chance,  ’’Direct  reconstruction  of  spatially  resolved  pharmacokinetic  rate 
images  of  fluorescence  agents”  to  he  submitted  to  IEEE  Transactions  in  Medical  Imaging. 

7)  K.  Kwon  and  B.  Yazici,  ”Two-level  domain  decomposition  methods  for  diffuse  optical  tomography,”  to  appear 
in  Inverse  Problems. 

8)  M.  Guven,  B.  Yazici,  K.  Kwon,  E.  Giladi,  X.  Intes,  ’’Effect  of  Discretization  error  in  diffuse  optical  absorption 
imaging,”  submitted  to  Inverse  Problems. 

9)  K.  Kwon  and  B.  Yazici,  ’’Born  expansion  and  Erechet  derivatives  in  diffuse  optical  tomography,”  submitted 
to  Inverse  Problems. 

10)  M.  Guven,  B.  Yazici,  K.  Kwon,  E.  Giladi,  X.  Intes,  ’’Effect  of  Born  Approximation  in  diffuse  optical  absorption 
imaging,”  in  preparation. 

1 1)  M.  Guven,  B.  Yazici,  X.Intes,  B.  Chance,  ”A  hierarchical  Bayesian  formulation  for  diffuse  optical  tomography 
with  a  priori  anatomical  information”,  in  the  proceedings  of  ASIEOMAR  Conference,  Pacific  Grove,  October 
30-November  2,  2005. 

12)  M.  Guven,  B.  Yazici,  X.Intes,  B.  Chance,  ’’Diffuse  optical  tomography  with  a  priori  anatomical  information”, 
in  the  proceedings  of  Applied  Imagery  Pattern  Recognition  (AIPR)  Workshop  Emerging  Technologies  and 
Applications  for  Imagery  Pattern  Recognition,  Washington,  DC,  USA,  October  19-21. 

13)  M.  Guven,  K.  Kwon,  B.  Yazici,  E.  Giladi,  X.  Intes,  ”A  Priori  Interpolation  Error  Based  Mesh  Generation  for 
Diffuse  Optical  Tomography”  in  the  proceedings  of  Applied  Imagery  Pattern  Recognition  (AIPR)  Workshop 
Emerging  Technologies  and  Applications  for  Imagery  Pattern  Recognition,  Washington,  DC,  USA,  October 
19-21. 


V.  Conclusions 

In  the  last  12  months,  our  collaborator.  Dr.  Britton  Chance  at  University  of  Pennsylvania  has  provided  116 
patient  data  collected  using  the  handheld  spectrometers.  This  data  set  is  not  tomographic,  however,  it  is  sufficiently 
informative  in  understanding  the  value  of  the  stand  alone  NIR  features  for  breast  cancer  diagnosis.  We  have  analyzed 
this  data  using  the  ROC  methodology  and  demonstrated  that  stand-alone  NIR  features  yield  a  high  AUC  (0.9  or 
above)  for  cancer  diagnosis.  We  are  in  the  process  of  documenting  our  results  as  a  journal  paper. 


We  have  developed  a  direct  pharmacokinetic  rate  imaging  method  for  ICG.  We  have  reported  our  research  in 
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three  conference  papers.  We  are  currently  in  the  process  of  documenting  our  research  as  a  journal  paper. 

We  have  extended  the  two  level  domain  decomposition  methods  and  published  it  as  a  journal  article.  We  have 
developed  adaptive  discretization  methods  for  diffuse  optical  tomography  based  on  a  priori  information  provided 
by  MR  images  and  submitted  this  work  as  a  journal  paper.  In  practice,  diffuse  optical  image  reconstruction  is 
performed  typically  using  linear  forward  model  based  on  Born  approximation.  We  analyzed  the  error  in  diffuse 
optical  image  reconstruction  due  to  Born  approximation  and  submitted  our  results  as  a  journal  paper. 

Dr.  Chance  has  also  collected  MR-NIR  concurrent  tomographic  data  and  ICG  data  from  10  patients.  We  are 
currently  in  the  process  of  processing  this  data.  We  plan  to  reconstruct  diffuse  optical  tomographic  images  with 
MR  a  priori  information.  We  will  also  apply  our  direct  reconstruction  method  to  obtain  IG  pharmacokinetic  rate 
images.  In  the  upcoming  12  months,  we  expect  to  collect  more  patient  data  and  perform  an  ROC  study  on  NIR 
features  extracted  from  tomographic  images. 
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Extended  Kalman  Filtering  for  the  Modeling  and 
Analysis  of  ICG  Pharmacokinetics  in  Cancerous 
Tumors  using  NIR  Optical  Methods 

Burak  Alacam,  Member,  IEEE,  Birsen  Yazici*,  Member,  IEEE,  Xavier  lutes,  Britton  Chance 


Abstract —  Compartmental  modeling  of  indocyanine  green 
(ICG)  pharmacokinetics,  as  measnred  by  near  infrared  (NIR) 
techniqnes,  has  the  potential  to  provide  diagnostic  information 
for  tumor  differentiation.  In  this  paper,  we  present  three  different 
compartmental  models  to  model  the  pharmacokinetics  of  ICG 
in  cancerous  tumors.  We  Introduce  a  systematic  and  robust 
approach  to  model  and  analyze  ICG  pharmacokinetics  based  on 
the  extended  Kalman  filtering  (EKF)  framework.  The  proposed 
EKT  framework  effectively  models  multiple-compartment  and 
multiple-measurement  systems  in  the  presence  of  measurement 
noise  and  uncertainties  in  model  dynamics.  It  provides  simul¬ 
taneous  estimation  of  pharmacokinetic  parameters  and  ICG 
concentrations  in  each  compartment.  Moreover,  the  recursive 
nature  of  the  Kalman  filter  estimator  potentially  allows  real 
time  monitoring  of  time  varying  pharmacokinetic  rates  and 
concentration  changes  in  different  compartments.  Additionally, 
we  introduce  an  information  theoretic  criteria  for  the  best 
compartmental  model  order  selection,  and  residual  analysis  for 
the  statistical  validation  of  the  estimates.  We  tested  our  approach 
using  the  ICG  concentration  data  acquired  from  four  Eischer  rats 
carrying  adenocarcinoma  tumor  cells.  Our  study  indicates  that, 
in  addition  to  the  pharmacokinetic  rates,  the  EKE  model  may 
provide  parameters  that  may  be  useful  for  tumor  differentiation. 

Index  Terms — Extended  Kalman  filter,  indocyanine  green, 
compartmental  analysis,  pharmacokinetics,  tumor  characteriza¬ 
tion. 


I.  Introduction 

Near  infrared  (NIR)  diffuse  optical  imaging  and  spec¬ 
troscopy  methods  provide  quantitative  functional  infor¬ 
mation  that  cannot  be  obtained  by  the  conventional  radio¬ 
logical  methods  [I]-[3].  NIR  techniques  can  provide  in  vivo 
measurements  of  the  oxygenation  and  vascularization  states, 
uptake  and  release  of  optical  contrast  agents,  and  chromophore 
concentrations  with  high  sensitivity.  In  particular,  NIR  diffuse 
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optical  techniques  in  conjunction  with  optical  contrast  agents 
have  the  potential  to  characterize  angiogenesis,  and  to  differ¬ 
entiate  between  malignant  and  benign  tumors  [4]-[7]. 

At  present,  indocyanine  green  (ICG)  is  the  only  NIR  optical 
agent  approved  for  human  use.  In  NIR  measurements,  the 
presence  of  ICG  within  an  imaging  volume  results  in  an 
increased  signal  that  can  be  observed  over  the  course  of  the 
experiment.  Study  of  the  time  kinetics  of  ICG  concentration 
curves  may  provide  physiologically  relevant  information  for 
tumor  differentiation.  Specifically,  cancerous  tissue  types  are 
expected  to  show  high  and  fast  uptake  due  to  the  proliferation 
of  ’’leaky”  angiogenetic  microvessels,  while  normal  and  fatty 
tissue  show  little  uptake. 

A  number  of  research  groups  reported  compartmental  mod¬ 
eling  of  ICG  time-kinetic  measurements  using  NIR  methods 
for  tumor  diagnosis  in  animal  and  human  subjects  [8]-[10]. 
A  compartmental  model  is  a  mathematical  description  of  the 
concentrations  of  contrast  agents  in  which  each  compartment 
represents  a  kinetically  distinct  tissue  type.  It  consists  of 
a  set  of  coupled  ordinary  differential  equations  (ODE)  and 
a  measurement  model.  Coefficients  of  the  ODE’s  are  the 
physiological  parameters  of  interest  that  represent  rates  of 
exchange  between  different  compartments.  These  parameters 
are  non-linearly  related  to  the  total  concentration  of  ICG 
measured  by  NIR  methods.  Eurthermore,  concentration  of 
ICG  in  each  compartment  cannot  be  directly  measured  non- 
invasively  by  NIR  techniques,  making  the  pharmacokinetic 
parameter  estimation  a  highly  non-linear  problem. 

Current  methods  of  ICG  compartmental  modeling  involve 
curve  fitting  methods  and  various  techniques  for  solving  differ¬ 
ential  equations.  Gurfinkel  et  al.  presented  a  two-compartment 
model  for  ICG  kinetics  and  estimated  model  parameters  [8]. 
The  measurements  were  obtained  using  a  frequency  domain 
photon  migration  system  coupled  with  a  charge-coupled  de¬ 
vice.  The  pharmacokinetic  parameters  were  estimated  for  each 
pixel  based  on  a  curve  fitting  method.  This  study  indicated  that 
model  parameters  show  no  difference  in  the  ICG  uptake  rates 
between  normal  and  diseased  tissue.  Cuccia  et  al.  presented 
a  study  of  the  dynamics  of  ICG  in  an  adenocarcinoma  rat 
tumor  model  [9].  A  two-compartment  model  describing  the 
ICG  dynamics  was  used  to  quantify  physiologic  parameters 
related  to  capillary  permeability.  The  ICG  concentration  curves 
were  fitted  to  the  compartmental  model  using  a  non-linear 
least  squares  Levenberg-Marquart  algorithm.  It  was  shown 
that  different  tumor  types  have  different  capillary  permeability 
rates.  Intes  et  al.  presented  the  uptake  of  ICG  by  breast  tumors 
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using  a  continuous  wave  diffuse  optical  tomography  apparatus 
[10].  A  two-compartment  model  was  used  to  analyze  the 
pharmacokinetics  of  ICG.  A  curve  fitting  algorithm,  namely 
the  non-linear  Nelder-Mead  simplex  search,  was  used  to 
estimate  the  pharmacokinetic  parameters.  This  study  showed 
that  the  malignant  cases  exhibit  slower  rate  constants  (uptake 
and  outflow)  as  compared  to  healthy  tissue. 

While  the  studies  described  above  demonstrate  the  feasi¬ 
bility  of  the  ICG  pharmacokinetics  in  tumor  characterization; 
due  to  the  highly  non-linear  nature  of  the  pharmacokinetic 
parameter  estimation,  variation  in  parameter  values  from  one 
subject  to  another,  and  sparse  data  available  in  clinical  and 
laboratory  settings,  a  systematic  and  robust  approach  is  needed 
to  model,  estimate  and  analyze  ICG  pharmacokinetics.  Such 
an  approach  must  include:  i)  a  method  for  compartmental 
model  order  selection,  ii)  a  robust  method  of  estimating  ICG 
pharmacokinetic  parameters,  and  in)  a  method  of  validating 
the  selected  model  and  the  estimation  results. 

In  this  paper,  we  present  three  different  compartmental 
models  for  the  ICG  pharmacokinetics  in  cancerous  tumors 
and  propose  an  extended  Kalman  filtering  (EKF)  framework 
to  estimate  the  model  parameters.  The  models  capture  the 
transportation  of  ICG  between  the  vascular  and  extravascular 
compartments,  including  interstitial  fluid  region,  parenchymal 
cell,  intracellular  binding  site,  and  extravascular,  extracellular 
spaces  (EES).  The  extended  Kalman  filter  (EKF)  is  a  recursive 
modeling  and  estimation  method  with  numerous  advantages 
in  ICG  pharmacokinetic  modeling.  These  include:  i)  effective 
modeling  of  multiple  compartments,  and  multiple  measure¬ 
ment  systems  governed  by  coupled  ordinary  differential  equa¬ 
tions,  in  the  presence  of  measurement  noise  and  uncertain¬ 
ties  in  the  compartmental  model  dynamics;  ii)  simultaneous 
estimation  of  pharmacokinetic  model  parameters  and  ICG 
concentrations  in  each  compartment,  which  are  not  accessible 
in  vivo  by  means  of  NIR  techniques;  Hi)  recursive  estimation 
of  time-varying  pharmacokinetic  model  parameters;  iv)  statisti¬ 
cal  validation  of  estimated  concentrations  and  error  bounds  on 
the  pharmacokinetic  parameter  estimates;  v)  incorporation  of 
available  a  priori  information  about  the  initial  conditions  of  the 
permeability  rates  into  the  estimation  procedure;  vi)  potential 
real-time  monitoring  of  ICG  pharmacokinetic  parameters  and 
ICG  concentrations  in  different  compartments  due  to  the 
recursive  nature  of  the  EKF  estimation  method.  Additionally, 
we  present  a  method  for  selecting  the  optimal  compartmental 
model  order  based  on  a  Bayesian  information  criterion,  and  a 
statistical  validation  method  based  on  residual  analysis. 

We  test  our  approach  using  the  ICG  concentration  data 
acquired  from  four  Fisher  rats  carrying  adenocarcinoma  tumor 
cells.  Two-,  three-  and  four-compartment  models  are  fitted  to 
data  and  pharmacokinetic  model  parameters  and  concentra¬ 
tions  in  different  compartments  are  estimated  using  the  EKF 
framework.  The  Bayesian  information  criterion  suggests  that 
the  two-compartment  model  provides  a  sufficient  fit  for  our 
data.  The  estimated  model  order  and  the  model  parameters  are 
further  validated  by  residual  analysis.  The  model  parameters 
are  used  to  differentiate  between  two  types  of  cancerous 
tumors.  Our  study  suggests  that  the  permeability  rates  out  of 
the  vasculature  are  higher  in  edematous  tumors  as  compared 


to  necrotic  tumors.  Additionally,  we  observe  that  in  the  two- 
compartment  model,  the  ICG  concentration  curve  is  higher 
in  the  EES  compartment  in  edematous  tumors.  This  suggests 
that  the  ratio  of  the  peak  value  of  the  ICG  concentrations 
in  different  compartments  may  be  a  useful  parameter  to 
differentiate  tumors. 

The  paper  is  organized  as  follows:  In  Section  II,  we  present 
the  two-,  three-  and  four-compartment  models  for  ICG  phar¬ 
macokinetics  in  tissue.  In  Section  III,  we  present  the  state- 
space  representation  of  the  compartmental  models;  estimation 
of  ICG  pharmacokinetic  parameters  and  ICG  concentrations 
in  the  EKF  framework;  and  an  optimal  model  order  selection 
criterion.  In  Section  IV,  we  present  the  experimental  results 
obtained  from  Fischer  rat  data.  Section  V  summarizes  our 
results  and  conclusion.  Appendix  I  includes  the  derivation 
of  the  likelihood  function  used  in  the  Bayesian  information 
criterion. 

II.  ICG  Pharmacokinetic  Modeling  Using  NIR 
MEASUREMENTS 

A.  Indocyanine  Green 

ICG  is  an  optical  dye  commonly  used  in  retinopathy  and 
hepatic  diagnostics.  Given  its  low  toxicity  and  FDA  approval, 
it  has  recently  been  utilized  as  a  blood  pooling  agent  for  the 
detection  and  diagnosis  of  cancerous  tumors  by  means  of  NIR 
optical  methods.  The  absorption  peak  of  ICG  is  805  nm  and 
the  fluorescence  peak  is  at  830  nm.  ICG  has  strong  affinity 
for  blood  proteins.  In  plasma,  ICG  is  near-completely  bound, 
primarily  to  albumin.  As  a  result,  its  in  vivo  kinetics  are  similar 
to  those  of  a  70  kD  molecule,  although  it  has  a  molecular 
weight  of  about  700  D  [11]-[15]. 

ICG  is  eliminated  from  the  body  primarily  through  the 
bile.  Outside  of  the  circulatory  system,  it  is  not  available  for 
removal  until  it  returns  to  the  system.  The  kinetics  of  this 
transition  offers  a  potential  means  of  non-invasively  assessing 
the  leakiness  of  large  molecules  from  the  microvasculature; 
this  permeability  is  a  characteristic  of  the  poorly  developed 
vasculature  observed  in  angiogenesis.  The  increase  in  local 
microvasculature  density  is  also  expected  to  induce  increased 
perturbation  in  the  optical  signal  from  intercapillary  ICG. 

There  are  some  differences  in  the  delivery  of  ICG  between 
normal  and  cancerous  vasculature.  In  normal  tissue,  ICG  acts 
as  a  blood  flow  indicator  in  tight  capillaries  of  normal  vessels. 
However  in  tumors,  ICG  may  act  as  a  diffusible  (extravascular) 
flow  in  the  leaky  capillary  of  cancer  vessels.  To  investigate 
the  validity  of  this  hypothesis,  one  has  to  employ  at  least 
a  two-compartment  model  composed  of  plasma  and  EES. 
Additionally,  the  permeability  rate  is  expected  to  increase  as 
the  malignancy  advances  [9],  [10].  Fig.  1  (a)  and  (b)  illustrates 
the  ICG  flow  for  healthy  and  malignant  tissue,  respectively. 

B.  Compartmental  Analysis  of  ICG  Pharmacokinetics 

Compartmental  modeling  allows  relatively  simple  and  ef¬ 
fective  mathematical  representation  of  complex  biological  re¬ 
sponses  due  to  contrast  agents.  A  region  of  interest  is  assumed 
to  consist  of  a  number  of  compartments,  generally  representing 
a  volume  or  a  group  of  similar  tissues  into  which  the  contrast 


IEEE  TRANSACTIONS  ON  BIOMEDICAL  ENGINEERING 


3 


Fig.  1.  An  illustration  of  the  ICG  flow  (a)  in  tight  capillary  of  normal  vessel, 
(b)  in  permeable  capillary  of  tumor  tissue. 


Fig.  2.  A  simple  illustration  of  the  capillary  extracapillary  structure. 


agent  is  distributed.  The  concentration  change  in  a  specific 
compartment  is  modeled  as  a  result  of  the  exchange  of  contrast 
agent  between  connected  compartments.  These  changes  are 
modeled  by  a  collection  of  coupled  ODEs;  each  equation 
describing  the  time  change  dictated  by  the  biological  laws  that 
govern  the  concentration  exchanges  between  the  interacting 
compartments  [16]-[19].  In  this  work,  we  investigate  three 
different  compartmental  models  for  the  ICG  kinetics  and  deter¬ 
mine  the  optimal  model  order  based  on  Bayesian  information 
criteria. 

1 )  The  four-compartment  model:  Fig.  2  illustrates  the  cap¬ 
illary  and  extracapillary  space  relevant  to  the  four  compart¬ 
ment  model.  The  four-compartment  model  includes  capillary 
region,  interstitial  fluid  region,  parenchymal  cell  region  and 
intracellular  binding  site  as  compartments  [20].  The  ICG, 
injected  intravenously  into  the  subject,  can  pass  from  the 
capillary  into  the  reversible  binding  site  inside  the  cell  through 
the  interstitial  fluid  region  and  the  parenchymal  cell  region 
[20]-[22].  Moreover,  in  advanced  tumor  stages,  the  leakiness 
around  the  tumor  vessels  is  expected  to  increase,  resulting  in 
higher  permeability  rates  during  the  transportation  of  ICG  into 
the  compartments.  A  block  diagram  of  the  four-compartment 
transport  and  chemical  model  of  ICG  delivery  is  shown  in  Fig. 
3(a). 

Fet  Cp,  Ci,  Cpc,  Cb  denote  the  ICG  concentrations  in 
plasma,  the  interstitial  fluid  region,  the  parenchymal  cell  region 
and  the  intracellular  binding  site,  respectively;  and  let 
k^\  kf'b'’ ,  and  be  the  constants  used  as 

equilibrium  coefficients  as  shown  in  Fig.  3(a).  Then  the  set  of 
differential  equations  representing  the  ICG  transition  between 
the  four  compartments  is  given  as  follows: 

The  leakage  into  and  the  drainage  out  of  plasma: 

=  fcWa(f)  -  fcWCp(f)  -  k^lcp{f).  (1) 

The  leakage  into  and  the  drainage  out  of  the  interstitial  fluid 


(a) 

Fig.  3.  Block  diagrams  of  the  (a)  four-compartment,  (b)  three-compartment, 
(c)  two-compartment  models  for  the  ICG  pharmacokinetics. 


region: 

+  (2) 

The  leakage  into  and  the  drainage  out  of  the  parenchymal 
cell: 

=  fc<*>a(i)  -  kfc,c(t)  -  +  kfctit)- 

(3) 

The  leakage  into  and  the  drainage  out  of  the  intracellular 
binding  site: 

=  k^^^Cp,{t)  -  kfCb{t).  (4) 

Physiologically,  the  equilibrium  constants  are  defined  by  the 
permeability  surface  area  products  given  as  PSp,  where  P  is 
the  capillary  permeability  constant,  S  is  the  capillary  surface 
area,  and  p  is  the  tissue  density,  k^'^f  is  proportional  to  the  flow 
rate  into  and  out  of  the  capillary  and  k^\  k^\  k^'^  ,  k'^f^ , 
ki^\  and  k^^'^  represent  intra-tissue  physiologic  effects  during 
ICG  delivery  from  the  capillary  to  the  binding  site.  Note  that 
the  superscript  denotes  the  order  of  the  compartmental  model. 

The  actual  bulk  ICG  concentration  in  the  tissue  measured 
by  NIR  spectroscopy,  m{t),  is  a  linear  combination  of  the  ICG 
concentrations  in  the  four  different  compartments. 

m{t)  =  v^p'^Cp{f)  +  vl^^'’Ci{t)  +  v^'^JCpcit)  +  (5) 

where  v^\  Vpt\  are  volume  fractions  of  plasma, 
the  interstitial  fluid  region,  the  parenchymal  cell  region  and 
the  intracellular  binding  site,  respectively. 

2)  The  three-compartment  model:  In  this  model,  the 
parenchymal  cell  and  intracellular  binding  site  compartments 
are  combined  to  form  a  single  compartment  called  parenchy¬ 
mal  cell.  This  amounts  to  the  assumptions  that  the  transport 
of  ICG  into  the  intracellular  binding  site  is  negligible  as 
compared  to  the  other  compartments,  and  therefore  omitted 
from  the  model.  A  block  diagram  of  the  three-compartment 
transport  and  chemical  model  of  ICG  delivery  is  shown  in  Fig. 
3(b).  The  three-compartment  transport  equations  are  given  as 
follows: 
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The  leakage  into  and  the  drainage  out  of  plasma: 

(6) 

The  leakage  into  and  the  drainage  out  of  the  interstitial 
space: 

+  (7) 

The  leakage  into  and  the  drainage  out  of  the  parenchymal 
cell; 

-  k‘f'>CpS)  (8) 

The  total  ICG  concentration  measured  by  NIR: 

m{t)  =  v^^'>Cp{t)  +  wf  ^C'i(f)  +  v^p^Cpc{t)  (9) 

where  Vp^\  v^J  and  Cp,  Ci,  Cpc  are  as  defined  in  the 
four-compartment  model. 

3)  The  two-compartment  model:  In  the  two-compartment 
model,  the  tumor  region  is  assumed  to  be  composed  of  two 
compartments,  namely  the  plasma  and  the  extra-cellular  extra- 
vascular  space  (EES)  [9],  [23],  [24].  The  EES  is  defined  as 
the  region  that  lies  outside  of  both  the  vascular  region  and 
the  tumor  cells.  The  transport  of  the  ICG  to  the  third  and 
fourth  compartments  are  assumed  to  be  negligible.  Therefore 
the  last  two  compartments  in  the  four  compartment  model  is 
omitted.  We  consider  transcapillary  leakage  to  occur  only  at 
the  tumor  site.  We  also  assume  that  a  small  perturbation  of  the 
global  plasma  concentration  does  not  affect  the  bulk  removal. 
Eig.  3(c)  shows  the  block  diagram  of  the  two-compartment 
model  for  the  ICG  kinetics.  Let  Cp  and  denote  the  ICG 
concentrations  in  plasma  and  the  EES,  respectively.  Then  the 
two-compartment  ICG  chemical  transport  equations  are  given 
as  follows: 

The  leakage  into  and  the  drainage  out  of  plasma: 

=  fcf  a(f)  -  -  fcricp(f).  (10) 

The  leakage  into  and  the  drainage  out  of  the  EES: 

=  (11) 

The  parameters  and  govern  the  leakage  into  and 

("2) 

the  drainage  out  of  the  EES,  respectively.  The  parameter 
describes  the  ICG  elimination  from  the  body  through  kidneys 
and  liver. 

Actual  bulk  ICG  concentration  in  the  tissue  measured  by 
NIR  is  a  linear  combination  of  plasma  and  EES  ICG  concen¬ 
trations  given  by: 

m{t)  =  v^^'>Cp{t)  -f  (12) 

(2)  (2) 

where  the  parameters  Vp  and  Ve  denote  the  plasma  and  EES 
volume  fractions,  respectively. 

III.  Extended  Kalman  Eiltering  for  the  ICG 
Pharmacokinetics 

Eor  the  rest  of  our  discussion,  we  shall  use  the  explicit  form 
of  the  two-compartment  model  as  a  running  example  to  clarify 
our  notation.  Note  that  for  the  rest  of  the  paper,  all  matrices 
and  vectors  will  be  in  boldface  and  scalar  quantities  will  be 
in  non-boldface  notation. 


A.  State-space  Representation  of  the  ICG  Pharmacokinetics 

Coupled  differential  equations  resulting  from  the  two- 
compartment  modeling  of  the  ICG  pharmacokinetics  can  be 
expressed  in  state-space  representation  as  follows: 


■  dCS)  1 

7,(2) 

i^a 

■  cs) 

+c®(f) 

.  dCp{f)  J 

7,(2) 

-{ka^  +  k‘'ol)  _ 

Cp{t) 

(13) 

m{f)  = 

(2)  1  Ceit) 

^2  J  [  Cp{t) 

+  77(f) 

where  dB{t)  is  the  Wiener  process  increment,  dB{t)  =  U3{f)dt. 
Here,  a;(f)and  77(f)  can  be  thought  of  as  uncorrelated  zero 
mean  Gaussian  processes  with  covariance  matrix  Q,  and 
variance  cr^,  respectively. 

In  vector-matrix  notation,  the  continuous  time  state-space 
representation  for  the  n— compartment  model  is  given  by: 

dC{f)  =  K{a.n)C(f)dt  -\-  dB{t), 
m(t)  =\{o:„)C{t) -\-r]{t).  (14) 

In  (14),  C(f)  denotes  the  concentration  vector;  K(an)  is  the 
system  matrix,  V(q;„)  is  the  measurement  matrix  and  is 
the  parameter  vector  whose  elements  are  the  pharmacokinetic 
constants  and  the  volume  fractions  for  the  n— compartment 
model.  Eor  example  the  parameter  vector  0:2  for  the  two- 
compartment  model  is  given  by 

0:2  =  [ki^^  kill  (15) 

The  ICG  measurements  in  (14)  are  collected  at  discrete  time 
instances,  t  =  kT,  k  =  0,1,...,  where  T  is  the  sampling 
period.  Therefore,  the  continuous  model  described  in  (14) 
has  to  be  discretized.  To  simplify  our  notation,  we  shall  use 
C{k)  =  C{kT)  and  mfk)  =  m{kT).  The  discrete  state  space 
system  and  the  measurement  models  are  given  as  follows: 

C{k  +  1)  =  Kd{OLn)C{k)  +  ut{k) 

m{k)  =yd{an)C{k) -\-r]{k),  (16) 

where  Kd{ctn)  =  is  the  discrete-time  system  matrix 

and  Vd{Q.n)  =  V{a.n)  is  the  discrete-time  measurement 
matrix.  u>{k)  and  ri{k)  are  zero  mean  Gaussian  white  noise 
processes  with  covariance  matrix  and  variance  cr^,  respec¬ 
tively.  Discretization  of  state-space  models  can  be  found  in 
various  system  theory  books,  see  for  example  [25]. 

An  explicit  form  of  the  discrete  state  space  model  for  the 
two-compartment  case  is  given  as  follows: 


where  Tij  is  the  row  and  column  entry  of  the  system 
matrix  Kd{ot2)-  Note  that  the  matrix  entry  is  an  exponential 
function  of  the  parameters  k^\  k^f^  and 
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To  simplify  the  estimation  process,  we  shall  first  estimate 
the  matrix  entries,  Tij,  of  the  discrete-time  system  matrix 
f^d{oLn)  and  then  compute  the  pharmacokinetic  parameters  for 
each  compartmental  model. 


B.  Modeling  of  ICG  Pharmacokinetic  Parameters  and  Con¬ 
centrations  in  an  Extended  Kalman  Filter  Framework 


The  Kalman  filter  provides  a  recursive  method  to  estimate 
the  states  in  state-space  models,  in  which  the  states  are  driven 
by  noise,  and  the  measurements  are  collected  in  the  presence 
of  measurement  noise  [26]-[28].  In  the  case  of  non-linear 
state-space  models,  the  extended  Kalman  filter  linearizes  the 
model  around  the  current  state  estimate,  and  then  applies  the 
KF  to  the  resulting  linear  model.  The  EKF  framework  is  also 
utilized  for  the  joint  estimation  of  the  unknown  system  and/or 
measurement  parameters  and  states.  In  a  linear  state-space 
model  when  both  states  and  system  parameters  are  unknown, 
the  linear  state-space  model  can  be  regarded  as  a  non-linear 
model  in  which  the  linear  system  parameters  and  states  are 
combined  to  form  the  new  states  of  the  non-linear  model. 
This  system  is  then  linearized  and  solved  for  the  unknown 
states  using  the  KF  estimator.  We  consider  a  linear  Taylor 
approximation  of  the  non-linear  model.  The  details  of  the 
linearization  procedure  and  a  general  discussion  on  EKF  can 
be  found  in  [27],  [29]-[31]. 

In  our  problem,  the  objective  is  to  simultaneously  estimate 
the  states,  i.e.,  the  ICG  concentrations  in  each  compartment, 
and  the  system  and  measurement  parameters,  i.e.,  the  pharma¬ 
cokinetic  parameters  and  the  volume  fractions.  Let  denote 
the  discrete-time  parameter  vector  of  the  pharmacokinetic  rates 
and  volume  fractions.  Eor  example,  in  the  two-compartment 
model,  62  is  given  by 


02  = 


Til  Ti2  T21  T22  Ve 


(2)  „,(2) 


P 


(18) 


Note  that  the  parameter  vector  0„,  derived  from  the  state 
space  model  (17),  is  time  independent.  In  order  to  estimate 
On  within  the  EKE  framework,  the  following  dynamic  model 
is  introduced; 


0„(fc+l)  =  0„(fc)  +  ^(fc),  (19) 

where  ^(fc)  is  a  zero  mean  white  noise  process  with  covariance 
matrix  [27].  Here,  0„(fc)  can  be  thought  of  as  the 
update  of  the  parameter  rather  than  its  value  at  time  k. 

We  append  the  parameter  vector  9n{k  -F  1)  to  the  ICG 
concentration  vector  C(/c-|-l)  to  form  the  new  non-linear  state- 
space  model  given  by 


C(A:  +  1) 

■  K(0„)C(fc)  ■ 

1 

u){k) 

0„(fc  -F  1) 

Onik) 

<i{k) 

m(fc)  =  [  Vd(0„)  0 

where  Kj/'n)  =  ^^{oLn)- 


C{k) 

Onik) 


+  V{k), 


C.  EKF  Joint  Estimation  of  ICG  Concentrations,  Pharmacoki¬ 
netic  Parameters,  and  Volume  Fractions 

In  this  section  we  will  summarize  the  major  steps  of  the 
EKE  estimator  for  the  joint  estimation  of  ICG  concentrations 
and  compartmental  model  parameters. 

Let  the  subscript  k\t  denote  the  estimate  at  time  k  given 
all  the  measurements  up  to  time  t.  Then  the  l-step  ahead 
prediction  of  the  ICG  concentrations  and  the  compartmental 
model  parameters  are  given  as  follows: 


C 

■  K(0„)C  ■ 

On 

k\k-l 

0« 

Eor  the  two-compartment  model,  (21)  becomes 


■  Ce  ■ 

TllCe  +  Ti2C'p 

= 

T2lCe  +  T22Cp 

.  02  . 

k\k-l 

02 

(22) 


The  error  covariance  matrix,  of  the  1-step  ahead 

predictions  is  given  as  follows: 


Pfc|fc-i  —  Jfc-iPfe-i|fc-iJfc-i  + 


Qd  0 

0  Sd 


(23) 


where  is  the  Jacobian  of  the  non-linear  EKE  system 
function  at  time  k.  Explicitly,  it  is  given  by: 


Ifc  = 


K{6n)  J;[K(0;)C] 


0 


I 


(24) 


k\k 


where  0  and  I  denote  zero  and  identity  matrices,  respectively. 
The  Jacobian  matrix  for  the  two-compartment  model  becomes 


i  k  — 


1 

(  tIi 

ri2  \ 

1 

\  T2I 

T22  J 

. 

0(6 

x2) 

Ce  Cp  0 


0  0  0 


0  0 


Ce  Cp  0  0 


I 


(6x6) 


-I  klk 

(25) 

where  0(gx2)  is  a  6  x  2  zero  matrix,  and  1(6x6)  is  a  6  x  6 
identity  matrix. 

The  1-step  ahead  predictions  are  updated  to  the  fc‘^-step 
estimates  by  means  of  the  Kalman  gain  matrix  which  is  given 
by 

Gk  =  Pfc|fc_iA^[APfc|fc_iA^  +  al]-\  (26) 

where  A  is  the  following  vector 


V<i(0)  ^[Vd(0)C] 


J  k\k-l 


For  the  two-compartment  model  the  A  vector  becomes 


0  0  0  0  a  c'„ 


J  k\k-l 


(27) 


(28) 


The  fc‘^-step  estimate  of  the  concentrations  and  the  param¬ 
eters  are  obtained  recursively  using 


c 

c 

0 

k\k 

0 

-F  Gk{m{k)  -  [Vd(0)C]fc|fe_i). 

k\k-l 


(29) 
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For  the  two-compartment  case,  the  A:*^-step  estimate  of  the 
concentrations  and  the  parameters  is 


■  Ce  ■ 

■  Ce  ■ 

Cp 

= 

Cp 

.  O2  . 

k\k 

.  02  . 

+Gfe[m(fc)  -  -  Vp^^^Cp)k\k-l]  (30) 

The  error  covariance  matrix,  Pk\k^  of  the  fc‘^-step  estimates 
is  updated  as 

Pfc|fc  =  [I  —  GfcA]Pfen._i,  (31) 

where  I  is  the  identity  matrix. 

In  general,  the  convergence  of  EKF  depends  on  proper 
choices  of  the  initial  values  of  the  parameters,  9,  initial  values 
of  the  concentrations,  C,  and  proper  selection  of  the  noise 
covariance  matrices  Sd,  Qd’  the  variance  [33].  The 
parameter  controls  the  convergence  of  the  Kalman  gain 
Gfc.  To  ensure  stability,  we  set  much  higher  than  the 
term  in  (26).  However,  setting  very  high  values 
of  cr^  leads  to  slow  convergence  of  the  Kalman  gain  G^.  The 
main  cause  of  divergence  in  EKF  can  be  tracked  down  to 
the  fact  that  a  change  in  the  parameter  vector  has  no  direct 
effect  on  the  Kalman  gain;  in  other  words,  there  is  no  coupling 
term  between  the  Kalman  gain  and  the  parameter  vector  [34]. 
Based  on  this  observation,  we  improved  the  convergence  of 
the  EKF  by  modifying  the  term  J(l,2)  =  ^K(0)C  in  (24), 
as  described  in  [34]. 

It  has  been  shown  that  ifQd,  Sd  and  are  selected  less  than 
the  actual  values,  it  leads  to  overconfidence  in  the  accuracy  of 
the  estimates  of  the  error  covariance  matrix  [32].  Therefore, 
we  regarded  these  matrices  as  tuning  parameters  and  not  as 
the  estimates  of  the  true  covariance  matrices,  as  suggested  in 
[32]. 

Theoretically,  the  state  estimates  can  be  initialized  at  the 
expected  value  of  the  ICG  concentrations,  i.e.  £^[€(0)].  One 
approach  to  the  initialization  of  the  parameters  is  to  utilize 
the  state-space  presentation  given  in  (16).  Since  E{m{0))  = 
Vd(0„(O))E;[C(O)],  m(O)-Vd(0„(O))E;[C(O)]  is  a  zero  mean 
random  variable.  If  we  express  the  variance  of  the  mea¬ 
surement  m(0)  in  terms  of  the  variance  of  C(0)  using  the 
measurement  model  in  (16),  and  solve  for  0„,  we  get  the 
estimate  0„(O)  as  the  most  appropriate  value  for  initialization. 
The  details  of  the  selection  of  the  initial  values  for  the 
parameters  can  be  found  in  [27]. 

The  initialization  of  the  error  covariance  matrix  is  also 
important  for  the  performance  of  the  EKF.  The  error  co- 
variance  matrix  is  the  matrix  which  provides  information 
about  the  error  bounds  for  the  estimates.  Theoretically,  the 
initial  error  covariance  matrix  is  a  diagonal  matrix  where  the 
diagonal  entries  are  the  initial  estimates  of  the  variance  of 
concentrations  and  pharmacokinetic  parameters,  i.e. 


Po|o  — 


C'ot;(C(0))  0 

0  Sd 


(32) 


In  depth  discussion  on  the  convergence  properties  of  the 
EKF  can  be  found  in  [27],  [32]-[34]. 


D.  Compartmental  Model  Order  Selection 

We  adopted  the  Bayesian  information  criterion  (BIC)  for  the 
optimal  model  order  selection.  BIC  is  a  well  known  informa¬ 
tion  theoretic  criterion,  in  which  the  optimal  model  order  is 
selected  by  minimizing  a  cost  function  to  avoid  overfitting. 
The  cost  function  depends  on  the  number  of  observations, 
the  number  of  unknown  parameters  to  be  estimated  and  the 
likelihood  function.  A  detailed  discussion  of  the  BIC  can  be 
found  in  [35]-[37]. 

In  order  to  calculate  the  BIC  for  different  compartmental 
models,  we  first  derived  a  likelihood  function  for  the  extended 
Kalman  filter.  The  derivation  is  based  on  maximum  likeli¬ 
hood  estimation  of  the  parameters  in  the  Kalman  filtering 
framework  given  as  in  [38],  [39].  We  then  modified  this 
likelihood  function  for  the  extended  Kalman  filter  estimator 
for  the  joint  estimation  of  compartmental  model  parameters 
and  concentrations.  The  details  of  the  derivation  is  provided 
in  the  Appendix  I. 

IV.  Experimental  Results  -  ICG  Pharmacokinetics 
IN  Fischer  Rat  Data 

We  applied  the  proposed  EKF  framework  to  the  pharma¬ 
cokinetic  analysis  of  ICG  data  obtained  from  four  Fischer  rats 
with  adenocarcinoma.  R3230ac  adenocarcinoma  cells  were 
injected  below  the  skin  into  four  Fischer  rats  3  weeks  prior  to 
measurements.  The  tumor  size  for  the  rats  varies  in  diameter 
from  5  to  30  mm.  Measurements  were  conducted  with  a 
combined  frequency-domain  and  steady-state  optical  technique 
that  facilitates  rapid  measurement  of  tissue  absorption.  Fre¬ 
quency  domain  measurements  were  obtained  at  674,  800, 
849,  898,  and  915  nm,  modulated  at  frequencies  from  50  to 
601  MHz,  sweeping  a  total  of  233  frequencies.  Tumors  were 
also  imaged  by  use  of  contrast-enhanced  magnetic  resonance 
imaging  and  coregistered  with  the  location  of  the  optical  probe. 
In  addition,  a  broadband  continuous  wave  reflectance  measure¬ 
ment  spanning  the  range  650-1000  nm  was  performed  with 
a  spectrometer.  With  the  reduced-scattering  coefficient  spec¬ 
trum  and  diffusion  theory,  the  broadband  reflectance  spectra 
were  converted  to  absorption  coefficient  spectra.  The  absolute 
concentration  of  ICG,  together  with  oxy-hemoglobin,  deoxy¬ 
hemoglobin,  and  water  were  calculated  by  using  multiple 
linear  regressions  of  ICG  extinction  coefficient  spectra  to  the 
calculated  absorption  spectrum  at  approximately  every  second 
for  ten  minutes.  A  detailed  discussion  of  the  measurement 
process  and  apparatus  can  be  found  in  [40],  [41]. 

Fig.  4  presents  the  ICG  concentrations  (p,M)  from  four 
different  rats.  Tumors  in  Rat  1  and  2  are  classified  as 
necrotic  because  of  their  low  tissue  oxy-hemoglobin,  low  total 
hemoglobin,  and  low  gadolinium-diethylene-triamine  penta- 
acetic  acid  (Gd-DTPA)  enhancement  levels.  Tumors  in  Rat 
3  and  4  are  classified  as  edematous  due  to  their  high  water 
content  [42].  It  can  be  observed  from  Fig.  4  that  the  necrotic 
cases  display  low  peak  ICG  concentration  values  and  slowly 
rising  slopes  unlike  the  edematous  cases  with  high  peak  values 
and  sharp  rising  slopes. 

We  estimated  the  pharmacokinetic  rates  for  the  four-,  three- 
and  two-compartment  models.  Each  data  set  has  504  mea- 
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Fig.  4.  ICG  concentrations  measured  in  tissue  for  four  different  rats. 


surements.  The  reported  parameter  estimates  are  the  asymp¬ 
totic  values  obtained  when  the  extended  Kalman  filter  has 
converged.  In  other  words,  the  predicted  parameter  values 
corresponding  to  the  final  estimate,  i.e.,  ^^^gg^^.The 

results  are  given  in  Tables  I,  II,  and  III,  respectively.  The 
error  bounds  on  the  estimates  are  derived  from  the  covariance 
matrix  of  the  EKF  estimator.  The  estimated  pharmacokinetic 
rates  for  all  compartmental  models  indicate  that  the  exchange 
rates  between  the  capillary  and  the  adjacent  compartment  (ISS 
or  EES),  n  =  2,3,4,  are  significantly  different  for 

the  necrotic  and  edematous  tissue.  We  observe  that  for  the 
four-  and  three-compartment  models,  the  estimated  exchange 
rates  between  the  ISS  and  parenchymal  cell  compartments, 
/c",  k2,  n  =  3,4,  are  comparable.  Similarly,  the  estimated  rate 
of  drainage  out  of  the  plasma,  n  =  2,3, 4,  are  consistent 
for  all  models. 

Based  on  the  model  parameter  estimates,  we  computed  the 
BIC  values  for  each  rat  data  to  reveal  overfitting.  The  BIC 
values  and  the  number  of  unknown  parameters  for  each  rat 
data  are  tabulated  in  Table  IV.  The  BIC  suggests  that  the  two- 
compartment  model  is  sufficient  for  all  four  measurement  sets. 

We  further  analyze  the  goodness-of-fit  of  the  compartmental 
models  by  means  of  residual  analysis.  The  basic  idea  of 
residual  analysis  is  to  compare  the  actual  measurements  m{k) 
with  their  l-step  ahead  predictions,  based  on 

the  estimated  parameters.  A  detailed  discussion  on  residual 
analysis  can  be  found  in  [26],  [43].  The  mean  and  variance  of 
the  residual  error  for  the  four-,  three-  and  two-compartmental 
models  are  tabulated  in  Table  5.  To  normalize  the  error  with 
respect  to  the  magnitude  of  the  actual  measurements,  we 
calculated  the  signal-to-noise  ratio  (SNR)  using  the  median 
value  of  the  measurements  and  the  mean  of  the  residual  errors 
for  each  rat  data.  As  seen  from  the  results  in  Table  6,  the 
SNR  values  are  higher  for  the  two-compartment  case  for  all 
data  sets.  These  results  show  that  the  two-compartment  model 
provides  the  minimum  bias  and  the  best  statistical  efficiency. 
Fig.  5  shows  the  measured  total  concentration  data  and  its  1- 
step  ahead  prediction  based  on  the  two-compartment  model 
for  each  rat  data.  Clearly,  there  is  a  good  agreement  between 
the  actual  and  the  predicted  values. 

Based  on  the  BIC  and  residual  analysis,  we  conclude  that 
the  two-compartment  model  provides  the  best  statistical  fit  for 


(a) 


Rats  (Edamatous) 


(d) 


Fig.  5.  ICG  concentration  measurement  data  and  1-step  prediction  of  the 
measurements  for  four  different  rats. 


the  rat  data  and  investigate  the  estimated  model  parameters  in 
more  detail. 

In  the  two-compartment  model,  the  rate  of  leakage  into  the 
EES  from  the  capillary,  k^\  range  from  0.0247  to  0.0840 
sec~^  and  the  rate  of  drainage  out  of  the  EES  and  into 
the  capillary,  kjj^\  range  from  0.0106  to  0.0777  sec~^.  Note 
that  the  permeability  rates  for  the  necrotic  cases  are  lower 
than  the  ones  observed  for  the  edematous  cases.  Additionally, 
the  estimated  values  for  the  pharmacokinetic  rates  are  much 
higher  than  the  normal  tissue  values  due  to  the  increased 
leakiness  of  the  blood  vessels  around  the  tumor  region  [9], 
[44].  The  estimated  plasma  volume  fractions  agrees  with  the 
values  reported  earlier  [9],  and  the  values  presented  in  the 
literature  [45],  [46].  These  results  confirm  that  Vp  can  be 
large  in  tumors  and  that  its  magnitude  varies  with  respect  to 
the  stage  of  the  tumor  [24].  The  estimated  values  of  the  EES 
volume  fraction,  Ve  ,  range  from  0.218  to  0.53,  in  agreement 
with  the  0.2  to  0.5  range  reported  earlier  [23].  Note  that  these 
results  are  valid  only  for  the  ICG  pharmacokinetics  in  tumor 
cells  R3230ac,  adenocarcinoma  and  may  not  be  generalized 
for  other  types  of  contrast  agents  or  tumor  types. 

Fig.  6  shows  the  estimated  ICG  concentrations  in  plasma 
and  the  EES  compartments  for  the  two-compartment  model 
for  Rats  1  to  4.  Note  that  the  concentration  curves  in  Fig. 
5  and  Fig.  6  follow  a  similar  time  course  since  the  curves 


TABLE  IV 

Test  for  model  order  selection  for  three  different 

COMPARTMENTAL  MODELS  FOR  FOUR  DIFFERENT  DATA  SETS 


Rati 

Rat2 

Rat3 

Rat4 

Model 

p 

4'bic{p) 

<t>BIc(p) 

(pBIcip) 

4>bic{p) 

Two-comp.  Model 

1 

-178.24 

-198.36 

-202.81 

-172.09 

Three-comp.  Model 

11 

-71.62 

-83.85 

-92.18 

-63.91 

Four-comp.  Model 

15 

-39.72 

-45.12 

-56.34 

-30.02 
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TABLE  I 

Four-Compartment  Model:Estimated  pharmacokinetic  parameters  using  EKE  algorithm 


{sec~^10~^) 

(sec“^10“^) 

ju(4) 

A/C 

(sec“^10“^) 

(sec“ '^10“^) 

ju(4) 

A/g 

(sec“^10“^) 

(sec“^10“^) 

^out 

(sec-llO-3 

Rat  I  (Necrotic)  1.45±0.013 

1.22±0.019 

1.86±0.017 

2.02±0.026 

2.74±0.041 

2.41±0.051 

4.05±0.059 

Rat  2  (Necrotic)  3.48±0.048 

2.77±0.034 

4.28±0.048 

4.33±0.040 

2.98±0.048 

3.03±0.061 

4.76±0.062 

Rat  3  (Edematous)  4.94it0.052 

5.16±0.067 

4.22±0.052 

4.13±0.067 

4.14±0.070 

4.27±0.078 

5.39±0.085 

Rat  4  (Edematous)  5.25±0.053 

5.31±0.063 

5.07±0.068 

5.22±0.063 

4.43±0.075 

4.03±0.072 

3.85±0.056 

TABLE  II 

Three-compartment  Model:Estimated  pharmacokinetic  parameters  using  EKE  algorithm 

1.(3) 

(sec“°10“^) 

1.(3) 

(sec-ilO-2) 

kT 

(sec“G0“^) 

r.(3) 

’'out 

(sec“^10-®) 

Rat  1  (Necrotic) 

I.93±0.061 

1.28±0.049 

I.82±0.032 

2.02±0.041 

3.89±0.052 

Rat  2  (Necrotic) 

4.41±0.074 

2.48±0.067 

4.87±0.066 

5.03±0.057 

5.45±0.071 

Rat  3  (Edematous) 

4.71±0.085 

3.88±0.077 

4.95±0.059 

4.68±0.050 

4.42±0.040 

Rat  4  (Edematous) 

5.29±0.091 

6.48±0.096 

4.48±0.062 

4.20±0.048 

5.01±0.055 

TABLE  III 

Two-compartment  Model:Estimated  pharmacokinetic  parameters  and  volume  fractions  using  EKE  algorithm 

^2) 

(sec“^10“^) 

(sec“°  10“^) 

r.(2) 

’'out 

(sec“^10“®) 

(10-2) 

Up 

(10-2) 

Rat  1  (Necrotic) 

2.47±0.043 

1.06±0.052 

4.61±0.073 

21.8±1.92 

1.41±0.053 

Rat  2  (Necrotic) 

3.54±0.082 

2.98±0.086 

4.83±0.092 

25.4±3.49 

2.42±0.088 

Rat  3  (Edematous) 

6.90±0.101 

4.93±0.072 

3.95±0.048 

30.4±2.81 

4.84±0.120 

Rat  4  (Edematous) 

8.40±0.114 

7.77±0.091 

4.02±0.068 

53.0±4.73 

7.03±0.321 

TABLE  V 

The  mean  and  variance  of  the  error  between  the  estimates  and 
measurements 


Four-compartment  Three-compartment  Two-compartment 


Mean 

Variance 

Mean 

Variance 

Mean 

Variance 

Rati 

0.0987 

7.6e-004 

0.0605 

4.7e-004 

0.0072 

2.5e-005 

Rat2 

0.1043 

9.1e-004 

0.0767 

3.0e-004 

0.0057 

4.8e-005 

Rat3 

0.1204 

8.9e-004 

0.0883 

4.9e-004 

0.0041 

3.0e-005 

Rat4 

0.0904 

5.9e-004 

0.0589 

6.8e-004 

0.0076 

8.6e-005 

provide  useful  information.  We  consider  the  ratio  of  the  peak 
concentrations  in  plasma  and  the  EES  as  a  potential  parameter 
to  discriminate  different  tumors.  The  peak  Cp/Cg  ratio  for 
Rats  1  to  4  is  0.551,  0.593,  0.787,  1.151,  respectively.  This 
ratio  is  higher  in  edematous  cases  consistent  with  the  fact  that 
ICG-albumin  leaks  more  into  the  EES  in  edematous  tumors. 
Additionally,  the  ICG  concentration  in  plasma  decays  faster 
than  the  ICG  concentration  in  the  EES  due  to  its  elimination 
through  the  liver  and  kidneys. 


TABLE  VI 

SNR  VALUES  FOR  THREE  DIFFERENT  COMPARTMENTAL  MODELS  FOR 
FOUR  DIFFERENT  DATA  SETS 


Rati 

Rat2 

Rat3 

Rat4 

Model 

SNR  (dB) 

SNR 

SNR 

SNR 

Two-compartment  Model 

73.2 

68.1 

108.3 

107.9 

Three-compartment  Model 

30.7 

36.1 

23.9 

47.0 

Four-compartment  Model 

20.8 

29.9 

27.7 

18.4 

in  Eig.  6  is  a  linear  combination  of  the  curves  in  Eig.  5. 
Note  that  initial  estimates  of  concentrations  are  noisy  due  to 
the  limited  data  used  in  the  recursive  EKE  estimation.  This 
can  be  improved  by  Kalman  backward  smoothing  [47].  The 
peak  values  of  the  plasma  concentration,  Cp,  range  from  2.72 
fiM  to  4.28  fiM.  The  absolute  value  of  the  concentrations 
may  not  be  very  useful.  However,  concentration  of  ICG  in  a 
compartment  relative  to  the  one  in  another  compartment  may 


V.  Conclusion 

In  this  paper  we  present  three  different  compartmental 
models,  an  extended  Kalman  filtering  framework  for  the 
modeling,  and  estimation  of  ICG  pharmacokinetics  in  can¬ 
cerous  tumors  based  on  NIR  measurements.  Additionally, 
we  introduce  an  information  theoretic  criterion  and  residual 
analysis  for  model  selection  and  statistical  validation.  The 
proposed  compartmental  models  are  fit  to  data  obtained  from 
Eischer  rats  with  adenocarcinoma  cells.  The  pharmacokinetic 
rates  and  volume  fractions  are  estimated  for  all  models.  The 
estimated  rates  for  all  compartmental  models  indicate  that 
the  exchange  rates  between  the  capillary  and  the  adjacent 
compartment  (ISS  or  EES)  are  significantly  larger  for  the 
edematous  tissue  as  compared  to  the  necrotic  cases.  Based 
on  the  BIG  and  residual  analysis,  we  conclude  that  the  two- 
compartment  model  provides  the  best  statistical  fit  for  the 
rat  data  and  ICG  pharmacokinetics.  Parameters  of  this  model 
indicate  that  the  permeability  rates  are  higher  for  edematous 
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Fig.  6.  ICG  concentrations  in  plasma,  Cp{t)  and  EES,  Ce{t),  for  four 
different  rats,  (a)  Rati,  (b)  Rat2,  (c)  Rat3,  and  (d)  Rat4. 

cases  as  compared  to  the  necrotic  tumors.  Additionally,  we 
estimated  the  ICG  concentrations  in  different  compartments. 
The  concentrations  in  different  compartments  may  provide 
additional  parameters  for  tissue  characterization. 

While  our  study  indicates  that  the  two-compartment  model 
provides  the  best  fit  for  the  ICG  pharmacokinetics,  the  three-  or 
four-compartment  models  may  be  advantageous  for  modeling 
the  pharmacokinetics  of  functionalized  optical  contrast  agents 
that  actively  accumulate  or  activate  in  diseased  tissue  [48]- 
[50].  In  the  near  future,  we  plan  to  analyze  the  pharmacoki¬ 
netics  of  optical  agents  within  the  framework  of  EKF  using 
data  sets  collected  from  animals  and  human  subjects. 
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Appendix  I 

The  cost  function  for  the  BIC  is  given  by 

4>bic{p)  =plnA^-21nL(0p,m(l), . ,m{N)),  (33) 

where  p  is  the  dimension  of  9p,  which  is  related  to  the  number 
of  compartments  in  the  model,  N  is  the  data  length,  and 

L(0, m(l), m(2), . ,m{N))  is  the  likelihood  function.  The 

likelihood  function  for  the  EKF  is  given  by 

1  ^ 

L{e,m{l), . ,m{N))  =  ^  ln[det(Hfc)] 

^  k^l 

(34) 

^  fc=i 

where  the  matrix  H  is  defined  as: 

Hfc  =  APfc|fc_iA^  -I-  al, 


and  cr^.  A,  and  Pfc|fc_i  are  as  defined  in  Section  III.C.  The 
vector  A  is  defined  as: 

Afc  =  m{k)  -  [Vd{e)C]kik-i,  (36) 

where  m{k)  is  the  ICG  concentration  data  collected  from 
Fisher  rats  at  time  k,  and  [Vd(0)C]fc|fe_i  is  the  1-step  ahead 
estimate  of  the  volume  fractions  and  concentrations.  The 
explicit  form  of  the  likelihood  function  for  BIC  calculation 
is  given  by 

1  ^ 

L{e,  m(l), . ,  m{N))  =  - ^  E  ln[det(APfe|fe_i  A^  +  al)] 

^  k^l 

1  ^ 

-2  E[^(*)  -  [Vd(0)C]fc|fc_i]^[APfc|fc_iA^  +  al]-^ 
k—1 

.[m{k)  -  [Vd(0)C]fe|fc_i].  (37) 

where  all  the  parameters  and  matrices  are  as  defined  in  Section 
III.C. 
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ABSTRACT 

In  this  work,  we  present  spatially  resolved  pharmacokinetic  rate  images  of  indocyanine  green  (ICG)  obtained 
from  three  breast  cancer  patients  using  near  infrared  imaging  methods.  We  used  a  two-compartment  model, 
namely,  plasma  and  extracellular  extravascular  (EES),  to  model  ICG  kinetics  around  the  tumor  region.  We 
introduced  extended  Kalman  filtering  (EKE)  framework  to  estimate  the  ICG  pharmacokinetic  rate  images.  The 
EKE  framework  allows  simultaneous  estimation  of  pharmacokinetic  rates  and  the  ICG  concentrations  in  each 
compartment.  Based  on  the  pharmacokinetic  rate  images,  we  observed  that  the  rates  from  inside  and  outside 
the  tumor  region  are  statistically  different  with  a  p-value  of  0.0001  for  each  patient.  Additionally,  we  observed 
that  the  ICG  concentrations  in  plasma  and  the  EES  compartments  are  higher  around  the  tumors  agreeing  with 
the  hypothesis  that  ICG  may  act  as  a  diffusible  extravascular  flow  in  leaky  capillary  of  cancer  vessels.  Our  study 
shows  that  spatially  resolved  pharmacokinetic  rate  images  can  be  potentially  useful  for  breast  cancer  screening 
and  diagnosis. 

Keywords:  extended  Kalman  filter,  indocyanine  green,  pharmacokinetic  analysis,  breast  cancer. 

1.  INTRODUCTION 

Near  Infrared  (NIR)  optical  imaging  has  recently  emerged  as  a  potential  diagnostic  tool  for  breast  cancer.  NIR 
imaging  has  several  advantages  over  existing  imaging  modalities.  It  is  easy  to  use,  minimally  invasive,  relatively 
inexpensive  and  can  be  made  portable.  More  importantly,  in  conjunction  with  molecular  imaging  probes,  NIR 
imaging  techniques  can  provide  cellular,  molecular  level  information,  which  can  be  used  for  cancer  imaging.  In 
particular,  dynamic  tomographic  imaging  methods  can  produce  absorption  images  which  can  be  used  to  derive 
kinetics  of  molecular  probes.  The  kinetic  rates  can  then  be  used  as  potential  detection  and  diagnosis  tools. 

Presently,  ICG  is  the  only  FDA  approved  fluorescent  agent  for  human  use.  It  is  hypothesized  that  ICG 
may  act  as  a  blood  flow  indicator  in  tight  capillaries  of  normal  vessels  and  as  a  diffusible  (extravascular)  flow 
in  leaky  capillary  of  cancer  vessels.^’ ^  A  number  of  studies  using  compartmental  modeling  were  reported 
to  show  the  feasibility  of  using  ICG  pharmacokinetics  in  tumor  characterization. Parameters  related  to 
capillary  permeability  were  used  as  malignancy  indicators.  Gurfinkel  et.  al.^  presented  in  vivo  fluorescent,  NIR 
reflectance  images  of  ICG  to  discriminate  spontaneous  canine  adenocarcinoma  from  normal  mammary  tissue 
using  a  two  compartmental  model.  It  was  claimed  that  the  model  parameters  show  no  difference  in  the  ICG 
uptake  rates  between  normal  and  diseased  tissue.  Cuccia  et  al.^  presented  a  study  of  the  dynamics  of  ICG  in 
an  adenocarcinoma  rat  tumor  model.  A  two-compartment  model  describing  ICG  dynamics  was  used  to  quantify 
physiologic  parameters  related  to  capillary  permeability.  It  was  shown  that  different  tumor  types  have  different 
capillary  permeability  rates.  Intes  et.  al.®  presented  the  uptake  of  ICG  by  breast  tumors  using  a  continuous  wave 
diffuse  optical  tomography  apparatus.  A  two-compartment  model  was  used  to  analyze  the  pharmacokinetics  of 
ICG.  It  was  shown  that  the  malignant  cases  exhibit  slower  rate  constants  (uptake  and  outflow)  compared  to 
healthy  tissue. 

The  studies  above  demonstrate  the  feasibility  of  the  ICG  pharmacokinetics  in  tumor  characterization.  How¬ 
ever,  estimation  of  pharmacokinetic  parameters  is  a  challenging  problem,  because  it  is  a  highly  non-linear  estima¬ 
tion  problem  with  sparse  data  available  in  clinical  setting.  Furthermore,  parameter  values  show  variation  from 


Further  author  information:  Send  correspondence  to  Birsen  Yazici,  e-mail:  yazici@ecse.rpi.edu 


one  subject  to  another.  To  address  these  issues,  we  recently  introduced  a  systematic  and  robust  approach  to 
model,  estimate  and  analyze  the  ICG  pharmacokinetics  based  on  extended  Kalman  filtering  framework.^’ ^  We 
introduced  Bayesian  Information  Criteria  to  choose  best  statistical  model  fit.  EKF  based  approach  can  be  also 
utilized  to  estimate  time-dependent  pharmacokinetic  rates.  We  tested  three  different  compartmental  models  for 
ICG  pharmacokinetics  using  NIR  data  collected  from  Fischer  rats  with  cancerous  tumors.  Our  study  showed  that 
the  permeability  rates  out  of  the  vasculature  are  higher  in  edematous  tumors  as  compared  to  necrotic  tumors. 

In  this  paper,  our  objective  is  to  obtain  spatially  resolved  pharmacokinetic  rate  images  of  ICG,  using  NIR 
data  acquired  from  three  breast  cancer  patients.  We  first  develop  a  set  of  temporally  resolved  2-D  ICG  con¬ 
centration  images  based  on  linearized  diffusion  equation.  We  then  estimate  the  ICG  pharmacokinetic  rates  and 
the  concentrations  in  each  compartment  for  each  pixel  based  on  the  FKF  framework  for  the  two-compartment 
model.  Reconstructed  2-D  pharmacokinetic  rate  images  of  the  three  patient  show  that  the  rates  from  inside  and 
outside  the  tumor  region  are  statistically  different  with  a  p- value  of  0.0001.  Additionally,  the  ICG  concentrations 
in  plasma  and  the  EFS  compartments  are  higher  around  the  tumor  agreeing  with  the  hypothesis  that  around 
the  tumor  region  ICG  may  act  as  a  diffusible  extravascular  flow  in  leaky  capillary  of  cancer  vessels. 

The  rest  of  the  paper  is  organized  as  follows:  In  Section  II,  we  present  the  reconstruction  of  differential 
absorption  and  ICG  concentration  images.  In  Section  III,  we  present  the  two-compartment  model  for  ICG. 
In  Section  IV,  we  present  the  EKF  based  estimation  of  ICG  pharmacokinetic  parameters.  In  Section  V,  we 
present  the  clinical  results  obtained  from  breast  cancer  patients.  Finally,  Section  VI  summarizes  our  results  and 
conclusion. 


2.  DIFFUSE  OPTICAL  TOMOGRAPHY  (DOT)  USING  ICG 

2.1.  Reconstruction  of  Bulk  ICG  Concentration  Images 

The  propagation  of  light  in  soft  tissue  is  modeled  by  diffusion  approximation.®  In  the  forward  model,  the 
analytical  solutions  of  the  heterogonous  diffusion  equation  given  in®  is  derived  using  first  order  Rytov  approx¬ 
imation.  The  sample  volume  is  divided  into  a  set  of  voxels  and  the  measurements  are  related  to  the  relative 
absorption  coefficients  of  each  voxel  by  a  system  of  linear  equations.  The  shape  of  the  breast  was  approximated 
as  a  cylinder  and  the  Kirchhoff  approximation^’  ®  for  diffuse  waves  was  used  to  model  the  interaction  of  light 
with  boundaries.  In  order  to  account  for  the  biological  noise,  the  forward  model  was  implemented  with  coupling 
coefficient  technique.®  The  inverse  problem  was  addressed  by  using  singular  value  decomposition  of  the  Moore- 
Penrose  generalized  system.  Tikhinov  regularization  was  used  to  stabilize  the  inversion  procedure.  The  L-curve 
method  was  applied  to  an  experimental  model  reconstruction  and  the  best  regularization  parameter  was  derived 
using  a  curvature  function  which  was  employed  in.^®  A  detailed  discussion  of  the  forward  and  inverse  models 
used  for  the  reconstruction  of  differential  absorption  coefficients  can  be  found  in.® 

We  used  the  linear  relationship  between  the  differential  absorption  coefficients  and  concentrations  given  by 
the  following  equation  to  reconstruct  a  set  of  temporally  resolved  2-D  ICG  concentration  images.^® 

AfXa  =  eC  InlO  =  2.303  e  C  (I) 

where  A/Za  denotes  the  differential  absorption  coefficient,  and  C  denotes  the  concentration  of  ICG,  and  e  is  the 
extinction  coefficient  of  ICG  at  wavelength  805nm.  Using  the  linear  relationship  above,  we  get  2-D  images  of 
ICG  concentrations  in  tissue  from  differential  absorption  images  reconstructed  at  different  time  instants  on  a 
pixel  by  pixel  basis.  In  the  following  Sections,  we  will  discuss  the  two-compartment  model,  and  the  estimation 
of  pharmacokinetic  rates. 

3.  ICG  PHARMACOKINETICS  AND  THE  TWO-COMPARTMENT  MODELING 

3.1.  Compartmental  Analysis  of  ICG  Pharmacokinetics 

In  this  paper,  we  assume  that  the  tumor  region  is  composed  of  two  compartments;  namely  plasma  and  the  EES.^’ 
Figure  1  shows  the  two-compartment  model  for  ICG  kinetics.  Cp  and  Ce  represent  the  ICG  concentrations  in 
plasma  and  the  EES,  respectively.  The  parameters  km  and  kout  govern  the  leakage  into  and  the  drainage  out 


Canillarv 

Cp 

keim 

kout 

kin 

EES 

Ce 

Figure  1.  Block  diagram  of  the  two-compartment  model  for  ICG  pharmacokinetics. 

of  the  EES.  The  parameter  keim  describes  the  ICG  elimination  from  the  body  through  kidneys  and  livers.  The 
parameters  Vp  and  Ve  are  plasma  and  the  EES  volume  fractions,  respectively. 

The  leakage  into  and  the  drainage  out  of  the  EES  is  given  by: 

=  -koutCeit)  +  ki^Cpit).  (2) 

The  leakage  into  and  the  drainage  out  of  plasma  is  given  by: 

- =  ~(kin  +  kelm)Cp{t)  +  koutCe(t)  .  (3) 

The  parameters  km,  kout,  and  keim  have  a  unit  of  sec~^.  They  are  defined  as  the  permeability  surface  area 
products  given  by  PSp,  where  P  is  the  capillary  permeability  constant,  S  is  the  capillary  surface  area,  and  p  is 
the  tissue  density.  We  consider  transcapillary  leakage  to  occur  only  at  the  tumor  site. 

Actual  bulk  ICG  concentration  in  the  tissue  measured  by  NIR  spectroscopy  is  a  linear  combination  of  plasma 
and  the  EES  ICG  concentrations: 

m{t)  =  VpCp{t)  +  VeCeit),  (4) 

where  Vp,  Ve,  Cg,  and  Cp  are  as  defined  above,  and  m(t)  denotes  the  bulk  ICG  concentration. 

Here,  we  consider  transcapillary  leakage  to  occur  only  at  the  tumor  site.  We  also  assume  that  a  small 
perturbation  of  the  global  plasma  concentration  does  not  affect  the  bulk  removal. 

4.  EXTENDED  KALMAN  FILTERING  FOR  THE  ESTIMATION  OF  ICG 

PHARMACOKINETICS 

Coupled  differential  equations  resulting  from  the  two-compartment  model  of  the  ICG  pharmacokinetics  can  be 
expressed  in  state-space  representation  as  follows: 

■  Ce{t) 

.  Cp{t) 


rri' 

where  u}{t)  and  r]{t)  are  uncorrelated  zero  mean  Gaussian  processes  with  covariance  matrix  Q,  and  variance  cr^, 
respectively. 

In  matrix-vector  notation,  the  state-space  representation  is  given  by: 

dC{t)  =  K{cx)C{t)dt  +  uj{t)dt, 
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m{t)  =  V{a.)C{t)  +  ri{t). 

where  C{t)  denotes  the  concentration  vector  with  elements  C'e(t),  and  Cp{t);  K(q:)  is  the  2x2  system  matrix, 
V(q;)  is  the  1x2  measurement  matrix  as  defined  in  equation  (5)  and  a  is  the  parameter  vector  given  by 

^  —  {^out  kin  kelni  '^e  '^p\' 

The  ICG  measurements  in  equation  (6)  are  collected  at  discrete  time  instances,  t  =  kT,  k  =  0,1, ...,  where  T 
is  the  sampling  period.  Therefore,  the  continuous  model  given  in  Equation  (6)  has  to  be  discretized.  To  simplify 
our  notation,  we  shall  use  C(fc)  =  C{kT)  and  m{k)  =  m{kT).  The  corresponding  discrete  state-space  model  is 
given  by: 


C{k  +  1)  =  Kd{a)C{k)  +  u;{k)  (8) 

mik)=Vd{a)C{k)  +  T]{k), 

where  Kci(a)  =  is  the  discrete  time  system  matrix  and  Vd(Q:)  =  V(a)  is  the  discrete  measurement 

matrix.  iv{k)  and  ri(k)  are  zero  mean  Gaussian  white  noise  processes  with  covariances  matrix  and  variance 
cr^,  respectively. 

An  explicit  form  for  the  discrete  time  state-space  model  is  given  as  follows: 
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m{k)  =  Ve  Vp 
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where  Tij  is  the  row  and  column  entry  of  the  system  matrix  K£;(q:).  The  matrix  entry  is  an  exponential 
function  of  the  parameters  kout  and  keim-  To  simplify  the  estimation  process,  we  shall  first  estimate  t^j’s 

and  then  compute  the  pharmacokinetic  parameters  kin,  kout  and  keim- 


We  use  the  EKF  algorithm  for  the  joint  estimation  of  the  unknown  pharmacokinetic  parameters,  kin,  kout, 
and  keim,  volume  fractions,  Ve,  and  Vp,  and  IGG  concentrations  in  plasma  and  the  EES,  Cp,  and  Cg-  In  a  linear 
state-space  model  when  both  states  (IGG  concentrations)  and  system  parameters  (pharmacokinetic  rates  and 
volume  fractions)  are  unknown,  the  linear  state-space  model  can  be  regarded  as  a  non-linear  model  in  which  the 
linear  system  parameters  and  states  are  combined  to  form  the  new  states  of  the  non-linear  model.  This  system 
is  then  linearized  and  solved  for  the  new  unknown  states  using  the  EKF  estimator.  A  detailed  discussion  of  the 
Kalman  filtering  algorithm  for  joint  estimation  of  the  pharmacokinetic  parameters  and  IGG  concentrations  in 
different  compartments  can  be  found  in.^’^ 


5.  CLINICAL  RESULTS 

5.1.  Apparatus 

In  this  work,  we  used  the  data  collected  with  a  continuous  wave  (GW)  NIR  imaging  apparatus.  The  apparatus 
has  16  light  sources,  which  are  tungsten  bulbs  with  less  than  1  watt  of  output  energy.  They  are  located  on  a 
circular  holder  at  an  equal  distance  from  each  other  with  22.5  degree  apart.  Sixteen  detectors,  namely,  silicon 
photodiodes,  are  situated  in  the  same  plane.  The  breast  is  arranged  in  a  pendular  geometry  with  the  source- 
detector  probes  gently  touching  its  surface.  Figure  2  illustrates  the  configuration  of  the  apparatus  and  the 
configuration  of  the  detectors  and  the  sources  in  a  circular  plane.  A  band  pass  filter  at  805nm,  the  absorption 
peak  of  IGG,  is  placed  in  front  of  the  sources  to  select  the  desired  wavelength.  A  set  of  data  for  one  source  is 
collected  every  500  ms.  The  total  time  for  a  whole  scan  of  the  breast  including  16  sources  and  16  detectors  is 
8.8  seconds.  The  detectors  use  the  same  positions  as  the  sources  to  collect  the  light  originating  from  one  source 
at  a  time.  Only  the  signals  from  the  farthest  11  detectors  are  used  in  the  analysis.  For  example,  when  Source  1 
is  on,  the  data  is  collected  using  detectors  4  to  14.  A  more  detailed  explanation  of  the  apparatus  and  the  data 
collection  procedure  can  be  found  in.^^ 


Figure  2.  The  cut  section  of  the  16  light  source-detector  device,  holding  a  human  breast  inside.  The  diameter  can  be 
fitted  easily.  The  16  light  source-detector  combinations  in  each  arm  are  located  equal  distance,  but  when  the  device  fits 
the  breast,  only  the  diameter  chances. 


Figure  3.  Differential  absorption  reconstruction  images  for  a  set  of  time  instants  for  Case  1. 

5.2.  Protocol 

Patients  with  suspicious  breast  tumors  were  enrolled  for  this  study.  ICG  was  injected  intravenously  by  bolus  with 
a  concentration  of  0.25  mg  per  kg  of  body  weight.  Diagnostic  information  is  obtained  using  biopsy  results.  Since 
biopsy  modifies  the  blood  volume  and  blood  flow  around  the  tumor  region,  measurements  were  made  before  the 
biopsy.  Data  acquisition  started  before  the  injection  of  ICG  and  continued  for  10  minutes. 

5.3.  Tumor  Information 

Three  different  patients  with  different  tumor  types  are  included  in  this  study.  First  case.  Case  1,  is  fibroadenoma, 
which  corresponds  to  a  mass  estimated  to  be  1—2  cm  in  diameter  within  a  breast  of  9  cm  diameter.  Second  case. 
Case  2,  is  adenocarcinoma  corresponding  to  a  tumor  estimated  to  be  2—3  cm  in  diameter  within  a  breast  of  7.7 
cm  diameter.  The  third  case,  Case  3,  is  invasive  ductal  carcinoma,  which  corresponds  to  a  mass  estimated  to  be 
3—4  cm  in  diameter. 

5.4.  ICG  Concentration  Measurements  for  Pharmacokinetic  Parameter  Estimations 

Using  the  CW  imager  described  above,  for  each  patient,  sufficient  number  of  source  detector  readings  were 
collected  from  different  angles.  Reconstruction  of  differential  absorption  images  using  the  source-detector  readings 
was  discussed  in  Intes  et  ah.®  A  sample  set  of  reconstructed  differential  absorption  images  for  Case  1,  Case  2,  and 
Case  3  for  9  selected  time  instants  are  shown  in  Figure  3,  4,  and  5,  respectively.  Although  only  9  reconstructions 
are  displayed,  there  are  approximately  50  reconstructions,  each  corresponding  to  a  different  time  instant  for  each 
patient.  Here,  each  2-D  image  is  composed  of  649  pixels. 

Using  the  linear  relationship  between  ICG  concentration  and  absorption  coefficient,  we  obtained  ICG  concen¬ 
tration  images  from  differential  absorption  images  for  each  case.  A  sample  set  of  ICG  concentration  images  for 
the  selected  time  instants  is  shown  in  Figure  6,  7,  and  8  for  Cases  1,  2  and  3,  respectively.  Here,  the  concentration 
images  represent  bulk  ICG  concentration  in  the  tissue,  not  specifically  in  plasma  or  the  EES. 


Figure  4.  Differential  absorption  reconstruction  images  for  a  set  of  time  instants  for  Case  2. 


Figure  5.  Differential  absorption  reconstruction  images  for  a  set  of  time  instants  for  Case  3. 
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Figure  6.  2-D  ICG  concentration  images  for  a  set  of  time  instants  for  Case  1 
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Figure  7.  2-D  ICG  concentration  images  for  a  set  of  time  instants  for  Case  2. 
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Figure  8.  2-D  ICG  concentration  images  for  a  set  of  time  instants  for  Case  3. 


Using  the  ICG  concentration  curves,  we  estimated  the  pharmacokinetic  parameters  for  each  pixel  based  on  the 
two-compartment  model.  We  then  constructed  2-D  permeability  rate  images  using  values  of  these  parameters. 
2-D  images  for  kin,  kout  for  two-compartmental  model  for  each  case  are  shown  in  Figures  9  (a),  (b),  10  (a),  (b), 
and  11  (a),  (b),  respectively.  We  also  analyzed  whether  the  kinetic  rates  are  statistically  different  or  not  for 
the  inside  and  outside  the  tumor  region.  The  kin  and  kout  values  from  inside  and  outside  the  tumor  region  are 
statistically  different  with  a  p-value  of  less  than  0.0001  for  all  cases. 

We  constructed  2-D  ICG  concentration  images  for  plasma  and  the  EES.  Figures  12-17  show  the  ICG  concen¬ 
tration  in  the  EES  and  plasma  for  3  different  time  instants  for  Case  1,  2,  3,  respectively.  We  observed  that  ICG 
concentrations  in  plasma  and  the  EES  compartments  are  higher  around  the  tumors  agreeing  with  the  hypothesis 
that  around  tumor  region  ICG  may  act  as  a  diffusible  extravascular  flow  in  leaky  capillary  of  cancer  vessels. 


Figure  9.  2-D  images  for  pharmacokinetic  rates  kin  and  kout  for  Case  1. 
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Figure  10.  2-D  images  for  pharmacokinetic  rates  kin  and  kout  for  Case  2. 
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Figure  11.  2-D  images  for  pharmacokinetic  rates  kin  and  kout  for  Case  3. 
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Figure  12.  2-D  ICG  concentration  images  in  plasma  for  Case  1  for  the  246.4*^,  334.4*^,  and  422.4*^  seconds. 
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Figure  13.  2-D  ICG  concentration  images  in  the  EES  for  Case  1  for  the  246.4*^,  334.4*^,  and  422.4*^  seconds. 
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Figure  14.  2-D  ICG  concentration  images  in  plasma  for  Case  2  for  the  228.8*^,  316.8*^,  and  404.8*^  seconds. 
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Figure  15.  2-D  ICG  concentration  images  in  the  EES  for  Case  2  for  the  228.8*^,  316.8*^,  and  404.8*^  seconds. 
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Figure  16.  2-D  ICG  concentration  images  in  plasma  for  Case  3  for  the  246.4*^,  378.4*^,  and  510.4*^  seconds. 
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Figure  17.  2-D  ICG  concentration  images  in  the  EES  for  Case  3  for  the  246.4*^,  378.4*^,  and  510.4*^  seconds. 


6.  CONCLUSION 

In  this  study,  we  presented  spatially  resolved  ICG  pharmacokinetic  images  together  with  the  ICG  concentrations 
in  plasma  and  the  EES  using  NIR  data  obtained  from  three  breast  cancer  patients.  We  observed  that  the  phar¬ 
macokinetic  rates  are  higher  inside  the  tumor  region  compared  to  the  outer  region;  and  the  ICG  concentrations 
in  plasma  and  the  EES  are  higher  around  the  tumor  region.  Our  study  indicates  that  pharmacokinetic  images 
can  be  potentially  useful  to  characterize  tumor  metabolism  and  angiogenesis.  In  the  near  future,  we  plan  to 
model  gadolinium  kinetics  and  compare  with  ICG  kinetics  and  biopsy  findings. 
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Abstract 

Diffuse  optical  tomography  (DOT)  in  the  near  infrared  involves  the 
reconstruction  of  spatially  varying  optical  properties  of  turbid  medium  from 
boundary  measurements  based  on  a  forward  model  of  photon  propagation. 
Due  to  the  nonlinear  nature  of  DOT,  high  quality  image  reconstruction  is  a 
computationally  demanding  problem  which  requires  repeated  use  of  forward 
and  inverse  solvers.  Therefore,  it  is  desirable  to  develop  methods  and 
algorithms  that  are  computationally  efficient.  In  this  paper,  we  develop  two- 
level  overlapping  multiplicative  Schwarz-type  domain  decomposition  (DD) 
algorithms  to  address  the  computational  complexity  of  the  forward  and  inverse 
DOT  problems.  We  use  frequency  domain  diffusion  equation  to  model  photon 
propagation  and  consider  a  nonlinear  least-squares  formulation  with  a  general 
Tikhonov-type  regularization  for  simultaneous  reconstruction  of  absorption 
and  scattering  coefficients.  In  the  forward  solver,  a  two-grid  method  is  used 
as  a  preconditioner  to  DD  to  enhance  convergence.  In  the  inverse  solver,  DD 
is  initialized  with  a  coarse  grid  solution  to  achieve  local  convergence.  We 
show  the  strong  local  convexity  of  the  nonlinear  objective  functional  resulting 
from  the  inverse  problem  formulation  and  prove  the  local  convergence  of  the 
DD  algorithm  for  the  inverse  problem.  We  provide  the  computational  cost 
analysis  of  the  forward  and  inverse  solvers  and  demonstrate  their  performance 
in  numerical  simulations. 


1.  Introduction 

Diffuse  optical  image  reconstruction  based  on  the  diffusion  equation  is  a  nonlinear  ill-posed 
problem  that  calls  for  the  use  of  nonlinear  minimization  methods  with  regularization  to  stabilize 
the  solution  [1]. 

Due  to  lack  of  analytical  solutions  for  practical  applications  with  arbitrary  geometries, 
DOT  image  reconstruction  is  often  posed  as  an  optimization  problem  involving  two  coupled 
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steps,  namely  forward  and  inverse  problems.  Each  step  consists  of  an  iterative  solver  whose 
solution  is  used  as  an  input  to  the  other  solver.  More  precisely,  the  forward  solver  computes 
the  photon  density  and  its  Jacobian  with  respect  to  the  current  optical  coefficient  estimates, 
and  the  inverse  solver  updates  the  optical  coefficients  based  on  the  output  of  the  forward  step. 
The  updated  coefficients  are  then  used  in  the  forward  solver  to  recompute  the  photon  density 
and  its  Jacobian.  As  a  result,  the  computational  complexity  of  DOT  image  reconstruction 
quickly  grows  with  the  number  of  unknowns  and  dimension.  Thus,  real  time  computation  of 
DOT  requires  numerical  techniques  to  reduce  the  complexity  of  the  problem. 

In  this  paper,  we  develop  two-level  domain  decomposition  (DD)  algorithms  to  address 
the  computational  complexity  of  the  forward  and  inverse  problems.  More  specifically,  we 
develop  an  overlapping  multiplicative  Schwarz-type  DD  algorithm  equipped  with  a  two-grid 
preconditioner  to  solve  the  forward  problem  resulting  from  the  finite  element  discretization 
of  the  frequency-domain  diffusion  equation.  For  simultaneous  absorption  and  scattering 
coefficients  reconstruction,  we  consider  a  nonlinear  least-squares  formulation  with  a  general 
Tikhonov-type  regularization.  To  solve  the  resulting  optimization  problem,  we  develop  a 
two-level  overlapping  multiplicative  Schwarz-type  DD  algorithm,  where  we  use  a  trust  region 
method  for  minimization.  Finally,  under  the  conditions  that  lead  to  the  local  strong  convexity 
of  the  nonlinear  objective  functional  considered  for  the  inverse  problem  formulation,  we  prove 
the  local  convergence  of  the  DD  algorithm  developed  for  the  inverse  problem. 

DD  methods  originate  from  the  Schwarz  alternating  procedure,  which  is  known  to  be 
the  first  DD  method  applied  for  solving  partial  differential  equations  (PDFs)  [32].  In  the  last 
two  decades,  motivated  by  the  need  for  fast  and  efficient  algorithms  for  solving  large-scale, 
three-dimensional  problems,  DD  methods  have  been  extensively  developed  and  applied  in  the 
area  of  numerical  solution  of  PDEs  [31,  33].  DD  methods  involve  partitioning  of  bounded 
domains  into  two  or  more  sub-domains,  thereby  dividing  the  original  problem  into  a  series  of 
smaller  sized  sub-problems  defined  on  the  sub-domains.  As  a  result,  DD  methods  allow  the 
parallel  solution  of  the  resulting  sub-problems,  making  the  computation  extremely  efficient. 

Owing  to  the  computational  advantages  they  offer,  DD  methods  have  been  applied 
to  inverse  problems  as  well.  In  [4-6,  36,  38],  DD  methods  were  applied  to  constrained 
convex  minimization  problems  arising  from  variational  inequalities,  where  the  term  ‘space 
decomposition’  was  introduced  for  DD  methods  in  the  context  of  optimization  problems.  In 
[5,  36,  38],  the  convergence  of  DD  methods  has  been  shown  for  convex  optimization  problems. 
DD  methods  have  also  been  applied  to  a  number  of  application  specific  inverse  problems, 
including  radon  transform  inversion  in  radar  and  x-ray  tomography  [8,  9],  geophysics  [14], 
parameter  estimation  problems  [37],  inverse  heat  conduction  problem  [29]  and  welding  and 
metal  cutting  problems  [23,  28].  In  the  area  of  DOT,  a  ‘data  driven  zonation’  method  coupled 
with  extended  Kalman  filtering  was  applied  in  conjunction  with  a  DD  method  [15],  where 
the  convergence  of  the  DD  algorithm  was  shown  empirically.  In  this  work  [15];  instead 
of  partitioning  the  main  inverse  problem  formulation,  independent  local  inverse  problems 
were  formulated  on  each  non-overlapping  sub-domain,  using  the  measurement  data  due  to 
the  source-detector  pairs  that  were  physically  present  in  the  sub-domain.  In  the  same  work, 
a  multi-grid  algorithm  was  used  to  accelerate  the  finite  difference  solution  of  the  forward 
problem;  however,  DD  methods  were  not  used.  A  similar  multi-grid-based  approach  for 
the  forward  problem  was  also  presented  in  [30].  Multi-grid  algorithms  in  the  context  of 
optimization  have  been  proposed  for  the  solution  of  the  inverse  DOT  problem  as  well  [26,  40], 
however  in  both  of  these  studies  [26,  40],  multi-grid  was  used  as  an  iterative  solver  and  DD 
methods  were  not  considered.  In  [16, 17],  a  fast  adaptive  composite-grid  (FAC)  algorithm  was 
proposed  for  the  linearized  DOT  inverse  problem,  which  can  be  viewed  as  a  fully  overlapping 
Schwarz-type  domain  decomposition  algorithm  [13],  to  provide  enhanced  spatial  resolution 
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in  a  region  of  interest.  Most  recently  in  [34],  a  multi-level  overlapping  Schwarz-type  domain 
decomposition  algorithm  was  proposed  for  the  linearized  inverse  DOT  problem,  which  was 
posed  as  a  linear  least-squares  problem  without  regularization.  In  all  these  studies  [16,  17,  34], 
DD  methods  were  not  pursued  for  the  solution  of  the  forward  problem  and  convergence  of  the 
proposed  algorithms  was  presented  empirically. 

In  this  work,  we  consider  an  overlapping  partitioning  of  the  optical  domain  to  convert 
the  finite  element  formulation  of  the  frequency-domain  diffusion  equation  into  a  number  of 
smaller-sized  sub-problems  and  use  a  multiplicative  Schwarz-type  DD  algorithm.  For  the 
numerical  solution  of  elliptic  PDFs  such  as  the  frequency-domain  diffusion  equation,  it  has 
been  shown  that  one-level  domain  decomposition  methods  are  not  efficient  in  conveying  the 
information  on  one  sub-domain  to  the  others,  which  is  attributed  to  the  rapid  decay  of  Green’s 
function  of  these  PDFs  and  the  dependence  of  the  solution  on  the  boundary  conditions  [33]. 
An  effective  way  to  address  this  issue  is  to  use  multi-grid  methods  [33].  Therefore,  we 
employ  a  two-grid  preconditioner  to  enhance  the  global  communication  of  the  multiplicative 
Schwarz-type  DD  algorithm  applied  on  the  fine  resolution  level.  We  note  that  the  multiplicative 
nature  of  the  Schwarz  algorithm  provides  higher  convergence  rate  as  compared  to  that  of  the 
additive  Schwarz  algorithms  (section  3.3).  We  refer  to  [10,  33]  for  a  detailed  discussion  about 
the  convergence  analysis  of  DD  methods  with  multi-grid  preconditioners  for  the  numerical 
solution  of  elliptic  PDFs.  For  the  inverse  problem,  we  consider  a  nonlinear  optimization 
problem  resulting  from  the  nonlinear  least-squares  formulation  with  a  general  Tikhonov-type 
regularization.  Unlike  the  approach  in  [15],  we  formulate  a  single  inverse  problem  on  the 
whole  domain  using  the  boundary  measurements  due  to  all  source-detector  pairs.  Then, 
we  apply  a  two-level  Schwarz-type  multiplicative  DD  algorithm  coupled  with  a  trust  region 
method  to  successively  minimize  the  resulting  objective  functional  on  each  overlapping  sub- 
domain.  In  this  case,  the  coarse  level  serves  to  provide  a  good  initial  guess  for  the  fine  level, 
where  the  inverse  problem  is  formulated.  Under  some  mild  conditions,  we  show  the  local 
strong  convexity  of  the  nonlinear  objective  functional.  Then,  we  prove  the  local  convergence 
of  the  DD  algorithm  for  the  nonlinear  inverse  DOT  problem  by  using  the  local  strong  convexity 
property  of  the  nonlinear  inverse  problem  formulation.  In  this  context,  the  coarse-level  solution 
used  as  the  initial  guess  to  the  fine  level  is  crucial  in  achieving  the  convergence.  Note  that  we 
show  the  convergence  properties  of  the  DD  algorithm  for  the  inverse  problem,  independent 
of  the  optimization  method.  Therefore,  the  convergence  properties  do  not  change  if  the 
trust  region  method  is  replaced  by  another  optimization  method.  We  perform  two  sets  of 
experiments  to  show  the  computational  savings  provided  by  the  proposed  DD  algorithms  as 
compared  to  non-DD  algorithms. 

The  rest  of  the  paper  is  organized  as  follows:  in  section  2,  we  define  the  forward  problem 
and  formulate  the  inverse  problem,  respectively.  In  section  3,  we  present  the  DD  algorithms 
developed  for  the  solution  of  the  forward  and  inverse  problems,  and  discuss  the  convergence 
properties  and  computational  complexity  of  the  proposed  algorithms.  Section  4  presents  the 
numerical  simulations  and  section  5  summarizes  our  results  and  conclusions.  The  paper 
includes  an  appendix  for  the  proof  of  the  local  convergence  of  the  DD  algorithm  developed 
for  the  inverse  problem. 

2.  Diffuse  optical  tomography 

2.1 .  Photon  diffusion  equation  in  frequency  domain 

We  model  the  propagation  of  light  in  biological  tissue  by  the  diffusion  equation  with  Robin 
boundary  conditions  [1].  In  frequency  domain,  the  photon  diffusion  equation  is  given  as 
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follows: 


/  10)  \ 

-V  ■(KVi>)  +  (fia  +  —)‘i>^q 


V  c  / 

O  +  2aK  —  =  0 
dv 


in 


(2.1) 


on 


where  is  a  Lipschitz  domain  in  R",  «  =  2,  3;  9 is  its  boundary,  <I>  is  the  photon  density,  c 
is  the  speed  of  light,  co  is  the  angular  frequency  of  the  source  g ,  v  is  the  unit  outward  normal 
vector  on  the  boundary,  a  is  a  parameter  to  account  for  the  the  refraction  index  mismatch  at 
the  boundary,  and  /x^ ,  /x'  and  k  =  are  the  absorption,  reduced  scattering  and  isotropic 

diffusion  coefficients,  respectively.  For  the  general  anisotropic  material,  see  [19]. 

The  unique  identification  of  the  optical  coefficients  /Xa  and  k  in  (2.1)  when  Dirichlet- 
to-Neumann  map  is  given  (or  when  infinite  sources  and  infinite  detectors  are  given)  can  be 
easily  shown  by  using  the  uniqueness  results  for  the  isotropic  case  [35].  For  the  uniqueness 
of  the  optical  coefficients  when  k  has  anisotropic  anomalous  region  contained  in  a  known 
background,  see  [18,  20,  22], 

In  [2,  3],  numerical  examples  show  that  the  Rytov  approximation  provides  better  DOT 
images  as  compared  to  the  Bom  approximation.  Therefore,  we  use  the  Rytov  measurements 
[27]  on  the  boundary: 


(2.2a) 


(2.2fi) 


Then,  the  Jacobian  of  F  with  respect  to  fia  and  /x'  is  given  as  follows: 


(2.3a) 


(23b) 


where  r  e  9f2,  O  is  the  complex  conjugate  of  <t>,  and  G  is  Green’s  function  of  (2.1). 

2.2.  Discretization  and  finite  element  method 

Suppose  that  there  are  Ns  sources  located  at  rj,  j  =  1,  . . . ,  and  Nd  detectors  located  on 
the  boundary  of  fJ,  at  ri+N,,  i  =  ,  Nd-  Let  Oj  be  the  solution  of  (2.1)  for  the  point  source 

qj (r)  —  S(r  —  rj).  Define 


L/.y  =  log(Oj(r,+A^J), 


(2.4) 


for  the  /th  detector  and  the  y  th  source. 

Consider  the  finite  element  space  spanned  by  the  bases  ut,k  —  ,  Nn-  Note  that 

for  piecewise  bilinear  element,  Nn  is  the  same  as  the  number  of  nodes.  Let  T,n  denote  the 
elements  for  m  =  ,  Ng,  where  Ng  is  the  number  of  the  elements.  Then,  the  finite  element 

formulation  for  (2.1)  in  this  finite  element  space  for  the  point  source  located  at  rj,  is  given 
as  follows: 


K  +  C+—A  = 

2a  J  ' 


(2.5) 
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where  K,  C,  A  are  N„  x  N„  matrices  and  /■'  is  a  x  1  column  vector  given  by 


Kki  — 
Cki  — 
Aki  — 


UkUi 


j  1  if  /  =  ;■ 

1 0  Otherwise, 


for  k,l  —  1,  . . . ,  Nn- 

Next,  we  dehne  the  following  function  spaces: 


e  <  \rj{r)\rea  ^  U^, 


(2.6a) 


=  {?7  €  y^’|?7  is  constant  at  each  Tm,m  —  1,  . . . ,  Ng],  (2.6b) 

where  y  is  either  /r-a  or  /r',  and  and  are  positive  constants.  Note  that  we  can  choose 
different  values  for  U  and  depending  on  /Xq  and  /x'.  Let  r]  e  he  r]  —  (r}\,  . . . ,  rj^J, 
then  we  define  the  norms  in  V^’  and  as  follows: 

WriWvy  =  II'7||l2(£X)  for  r]  e 


N,  2 

^  for  =  (?7i, . . . ,  e  y^^, 

I'''"' 

where  \  Tm  \  is  the  area  of  . 

Assume  that  /Xq  e  y^“,  /x^  €  y^'.  Let  <t>2  (^)  be  the  value  of  thy  at  the  Lth  node  point, 
G(k  :  j)  be  the  value  of  Green’s  function  at  the  ^h  node  point  due  to  the  yth  point  source, 
and  A:(m)  be  the  value  of  k  at  the  mth  element  T^.  By  discretizing  (2.3a),  we  obtain  the  value 
of  the  Jacobian  of  L,  j  at  the  with  element  as  follows: 

9  r  ■  ■  1 

=  -T^iTTN  E  :  k)[-3K(m)^E„(k,  1)  +  F„(k,  l)]<f>j(l),  (2.1a) 

d/ia  2a^j(i) 


9/x' 

where 


E,n(k,l)  —  J  VukVui, 

(2.8a) 

Em(k,l)^  j  UkUi. 

(2.%b) 

We  note  that  in  most  practical  applications,  computing  G(i  :  k)  for  each  point  source 
located  at  the  Lth  node  and  evaluating  at  the  ;th  detector  location  is  not  feasible.  Instead,  we 
consider  the  adjoint  problem  associated  with  (2.1)  [1],  with  the  adjoint  point  source  located  at 
the  ;th  detector  position.  Then,  the  solution  G*(k  :  i)  to  the  adjoint  problem  at  the  Lth  node 
for  the  adjoint  source  at  the  /th  detector  location  satisfies  G*(k  :  i)  =  G(i  :  k). 


1 


(m)  =  E  ■  k)[-3K(m)^E,„(k,l)]i>j(l), 


(2.1b) 


kj=l 
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2.3.  DOT  as  a  nonlinear  ill-posed  optimization  problem  and  the  trust  region  method 


Although  the  unique  determination  of  pua  and  from  the  measurement  F  at  the  boundary  is 
possible  for  infinite  sources  and  detectors  when  cu  ^  0  and  k  is  known  near  the  boundary  [1], 
the  unique  solvability  of  inverse  DOT  problem  for  finite  sources  and  detectors  is  not  known.  In 
addition,  when  the  number  of  measurements  is  less  than  the  total  number  of  unknowns  in  the 
discretized  inverse  problem,  the  underdetermined  problem  challenges  the  determination  of  the 
optical  coefficients.  However,  even  in  the  overdetermined  case,  the  inevitable  ill  posedness  of 
the  inverse  problem  may  result  in  large  perturbations  in  the  reconstructed  optical  coefficients 
due  to  small  amount  of  noise  in  the  measurements.  In  order  to  address  the  ill  posedness,  we 
use  general  Tikhonov-type  regularization  and  consider  the  following  nonlinear  minimization 
problem: 


{pa,d's)=  argmin  Fir],v), 

where 

j  Ns  Nj 

Fir],  v)  =  1^)  -  Mijf  +aA(r^,  v).  (2.10) 

^  7=1  i=l 

In  (2.10),  Mij  denotes  the  measurement  at  the  ith  detector  due  to  the  jth  source,  A  is  a 
non-negative  operator  from  V^°  x  into  non-negative  real  numbers  U  {0},  and  a  >  0 
is  the  regularization  parameter.  For  example,  A  can  be  chosen  as  follows: 

Aip,  V)  =  \\Ldh  -  +  ||i2(v  -  v*)f  ,,  (2.11) 

where  (rj*,  u*)  e  V^“  x  is  a  given  prior  for  the  optical  coefficients,  and  Li  and  L2  are 

Ng  X  Ne  matrices.  Note  that  depending  on  the  choice  of  Li  and  L2>  (2-1 1)  can  be  either  zeroth- 

or  first-order  Tikhonov  regularizer.  For  other  regularization  methods,  see  [19]. 

Note  that  (2.10)  can  be  extended  in  a  straightforward  manner  to  include  multi-frequency 
measurements.  However,  to  simplify  our  notation,  we  will  present  the  rest  of  our  development 
for  the  single  frequency  measurements. 

The  optimization  problem  in  (2.9)  is  composed  of  two  steps;  the  step  to  determine 
the  minimizing  direction  at  the  current  coefficients  (/x^,  Mj);  the  step  to  perform  line 
search  on  those  minimizing  directions.  In  the  Newtonian  method,  the  minimizing  direction  is 
—  {F")~^F',  where  F'  and  F"  are  the  Jacobian  and  the  Hessian  of  F  with  respect  to  (/Xa,  p'J, 
respectively. 

In  this  paper,  we  adopted  the  trust  region  method  [12],  one  of  the  Newtonian  approaches, 
which  updates  (/Xa,  /x')  in  (2.9)  iteratively  as 

(/Xa,  /x')  ^  (/Xa,  /x')  +  (Spa, 


by  solving  the  following  quadratic  minimization  problem  formulated  at  the  previous  update 

(S/Xa,  3/x^)  =  argmin  [\x‘ F" { Pa,  p',)x  +  x‘ F' {pa,  M*)}  - 

x^K 

subject  toll Dx II  ^  T.  (2.12) 

In  (2.12),  £>  is  a  scaling  matrix,  T  is  a  trust  region  parameter  and  A"  is  a  subspace  of  x  . 
The  scaling  matrix  D  is  used  to  handle  the  constraints  for  the  minimization.  To  stabilize  the 
minimization,  we  control  the  trust  region  parameter  T,  similar  to  the  way  the  parameter  X  in 
the  Levenberg-Marquardt  method  is  controlled  [7,  24,  25].  To  avoid  extensive  computation. 
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the  subspace  K  is  chosen  as  the  two-dimensional  subspace  composed  of  the  gradient  direction 
and  the  approximate  Newton  direction.  This  makes  the  trust  region  method  suitable  for 
large-scale  constrained  optimization  problems  like  the  one  in  (2.9). 

Note  that  F'  and  F"  for  a  given  (/x^,  /x')  in  (2.12)  are  given  as  follows: 

F'^fb  +  aK',  (2.13a) 

F"  ^  J'J  +  Hb  +  aA",  {l.Ub) 

where  b  —  Vi  j{jjLa,  l^'s)  —  -Mij,  J  and  H  are  the  Jacobian  and  the  Hessian  of  the  operator 
respectively.  /  is  given  by  /  =  i  =  \,...,Nd,j  = 

1, . . . ,  Ns,  m  —  1, . . . ,  Nc  as  in  {1.1a).  For  the  computation  of  FI,  see  [21]. 

Taking  the  vanishing  gradient  point  x  =  (5/Xa,  5/x')  G  x  of  the  quadratic  form 
ix'F"(/Xa,  fJi'^x  +  x'F'iiia,  fx's)  in  (2.12),  we  obtain 

F"(/x„,  iJ.'s)(Siia,  Six's)  =  -F'ilXa,  ix's).  (2.14) 

Using  (2.13a),  we  get 

[J'ilXa,  ix's)J(iXa,  l^'s)  +  Hijia,  /x' )/?  H- o'A"](5/Xfl,  3/x')  =  -[/(/Xfl,  /x' )'1j  H- a  A'].  (2.15) 

Thus,  if  (5/Xa  and  (5/x'  are  sufficiently  small,  the  trust  region  method  can  be  used  to  solve 
(2.12)  at  each  iteration  without  considering  the  scaling  matrix  D. 

We  discuss  in  section  3.2  how  we  apply  the  two-level  multiplicative  DD  method  to  solve 
the  optimization  problem  in  (2.9)  and  (2. 10)  by  the  trust  region  method. 

3.  Two-level  domain  decomposition  methods  for  diffuse  optical  tomography 

In  this  section,  we  describe  the  two-level  domain  decomposition  methods  considered  in 
this  paper,  as  applied  to  the  forward  and  inverse  problems.  For  simplicity  and  notational 
clarity,  we  will  describe  our  notation  and  approach  for  the  two-dimensional  optical  domain 
^2  =  [a,  6]  X  [c,  d]  C  and  bilinear  hnite  elements.  Its  extension  to  the  three-dimensional 
domain  is  straightforward,  by  appropriate  definition  of  operators  and  function  spaces. 

Let  {i''  denote  the  domain  ^2  that  is  uniformly  divided  by  Nx  times  in  the  x-axis  direction 
and  Ny  times  in  the  y-axis  direction.  Thus,  ^2*  has  A„  =  (Nx  -i-  1)  x  (Ny  +  1)  nodes  and 
Ng  —  Nx  y.  Ny  elements.  We  shall  call  ^2^  with  (Nxjl  -i- 1)  x  (Nyjl  +  1)  nodes  and  Nxjl  x 
Nyjl  elements,  the  coarse  level  of  ^2^,  assuming  Nx  and  Ny  are  even. 

Let  ^2*  be  decomposed  into  disjoint  union  of  d  sub-domains  Q,i,l  —  \, ...  ,d  such  that 

d 

^2'’  =  ljf2,.  (3.16) 

i=\ 

(3.16)  describes  the  non-overlapping  domain  decomposition.  For  the  overlapping  domain 
decomposition,  we  define  f2“,  I  —  \,  ...  ,d',  an  extension  of  ^2/,  recursively  for  all  non¬ 
negative  integers  w  as  follows:  ^2°  =  ^2;,  and  f2“  is  the  union  of  ^2““*  and  its  adjacent 
elements  in  contact  with  the  boundary  of  f2““\  where  w  will  be  called  the  width  of  the 
overlapping  region.  Thus,  the  overlapping  domain  decomposition  is  given  by 

d 

^2'' =  (3.17) 

l=\ 

Figure  1  illustrates  an  overlapping  and  a  non-overlapping  domain  decomposition  for  two  sub- 
domains.  Table  1  lists  the  acronyms  and  explanations  for  the  algorithms  developed  in  this 
paper. 
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Overlapping  domain  decomposition 


a,*" 

-- 

W 

Q,  ^2 

Nonoveriapping  domain  decomposition 


Figure  1.  Overlapping  domain  decomposition  and  nonoverlapping  domain  decomposition. 


Table  1.  The  acronyms  and  explanations  of  the  algorithms. 


non-DDM  non-domain  decomposition  method 

-Trust  region  method  for  the  inverse  solver  using  (2.9)  and  (2.10). 

and  finite  element  method  for  the  forward  solver 

without  using  domain  decomposition  and  multigrid  methods 


MODDM  Multiplicative  overlapping  domain  decomposition  method 

TMODDM  Two-level  multiplicative  overlapping  domain  decomposition  method 
-The  forward  solver  : 

Multiplicative  Schwarz  method  with  two-grid  preconditioner 

MSDM  Multiplicative  space  decomposition  method 

TMSDM  Two-level  multiplicative  space  decomposition  method 

-The  inverse  solver  : 

Multiplicative  Schwarz  method  with  coarse-level  initialization 
using  (2.9)  and  (2.10). 


3.1.  Two-level  multiplicative  overlapping  domain  decomposition  method  for  the  forward 
problem 

The  forward  problem  is  defined  by  the  boundary  value  problem  (2.1),  which  is  approximated 
by  the  finite  element  formulation  (2.5).  In  this  work,  we  apply  a  two-level  multiplicative 
overlapping  domain  decomposition  method  (TMODDM)  to  solve  (2.5)  in  an  attempt  to 
reduce  the  computational  complexity  of  the  forward  problem.  In  this  context,  the  multi-level 
structure  in  TMODDM  functions  to  speed  up  the  convergence,  while  DD  lets  us  formulate  the 
forward  problem  as  a  sequence  of  smaller-sized  problems,  thereby  reducing  the  computational 
requirements.  Each  sweep  of  TMODDM  involves  two  steps:  the  coarse-level  correction  and 
the  sub-domain  correction.  In  the  following  discussion,  we  describe  these  steps  in  detail. 

We  list  the  notation  and  explanation  for  the  variables  used  in  Algorithm  TMODDM  in 
table  2. 

3.1.1 .  The  coarse-level  correction:  By  formulating  a  smaller-sized  problem  on  the  coarse 
grid  the  coarse-level  correction  step  provides  an  approximation  to  the  error  in  the  optical 
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Table  2.  The  notations  used  in  Algorithm  TMODDM. 

Initial  guess  for  the  photon  density  for  the  jth  source 

^  The  update  of  by  coarse-level  correction 

at  the  nth  sweep  of  TMODDM 

<i>j  ^  The  update  of  ^  ^  after  p  successive  sub-domain 

corrections  on  at  the  nth  sweep  of  TMODDM 


density  estimate,  which  in  turn  is  used  to  update  the  optical  density  on  .  The  coarse-level 
correction  step  can  be  explained  in  detail  as  follows; 

Let  <t>"  denote  the  current  solution  update  for  the  optical  density  <t>  on  after  the  nth 
sweep  of  TMODDM.  The  coarse-level  correction  in  the  (n  +  l)th  sweep  starts  with  computing 


the  residual  [11]  on  the  fine  grid  : 

(3.18) 

and  restricting  it  onto  the  coarse  grid 

rn  ^  R(rh), 

(3.19) 

where  R  :  ^  ^(v„/2+i)x(v,/2+i)  restriction  operator.  In  this  work,  we  use 

the  full  weighting  restriction  operator  [39],  which  performs  a  nine-point  weighted  averaging. 
An  approximation  to  the  error  in  the  solution  update  <t>"  can  be  obtained  by  solving  the  defect 
equation  [39]  formulated  on 

(lif  +  C+^A)  en^rn.  (3.20) 

Then,  the  solution  of  (3.20)  is  used  to  update  <1>",  which  completes  the  coarse-level  correction: 

^  0"  +  P(eH),  (3.21) 

where  P  :  R(w./2+i)x(iv,/2+i)  ^  ]^(iv,+i)x(7v,+i)  prolongation  operator.  In  this  work,  we 
use  a  bilinear  prolongation  operator  [11]. 

3.1.2.  Sub-domain  correction.  Following  the  coarse  grid  correction  step,  the  sub-domain 
correction  is  performed  on  the  hne  grid  ,  by  employing  a  multiplicative  overlapping  domain 
decomposition  method  (MODDM).  In  this  step,  the  boundary  value  problem  (2.1)  is  redefined 
on  each  of  the  overlapping  sub-domains  with  appropriate  boundary  conditions.  Then,  the 
solution  of  the  boundary  value  problem  formulated  on  each  sub-domain  is  used  to  update  the 
optical  density.  In  the  following,  we  present  the  details  of  the  sub-domain  correction. 

Let  be  the  current  optical  density  estimate,  obtained  by  updating  with  the 

solutions  of  the  boundary  value  problems  formulated  on  the  sub-domains  ^2“ ,  . . . , 

Then,  the  boundary  value  problem  on  the  sub-domain  is  formulated  as 

—  V  ■  (kVu)  H- H- — jv  —  q  in  ^2^  (3.22a) 

v-\-2aK—^0  on  3^2“  0  3^2,  (3.22h) 

3v  ^ 


on 


3^2^\3^2. 


(3.22c) 
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Then,  the  solution  of  (3.22a)  is  used  to  obtain  the  new  optical  density  estimate  on  as  follows: 


O 


^rf+1 


V 

(t>"+3?I 


in 

in 


(3.23) 


Recursive  application  of  this  procedure  for  the  rest  of  the  sub-domains  Q.f  :  I  —  p  +  I,  p  + 
2, . . .  ,d  completes  the  sub-domain  correction  step  and  results  in  the  optical  density  update 
for  the  next  TMODDM  sweep.  We  refer  to  the  pseudo-code  in  algorithm  1  for  a  summary 
of  the  coarse-level  correction  and  sub-domain  correction  steps  in  TMODDM. 


3.2.  Two-level  multiplicative  overlapping  space  decomposition  method  for  the  inverse 
problem 

In  this  work,  we  apply  a  two-level  multiplicative  overlapping  space  decomposition  method 
(TMSDM)  to  solve  the  inverse  problem  in  (2.9).  We  use  the  term  ‘space  decomposition’  to 
distinguish  the  DD  method  developed  for  the  optimization  problem  (2.9)  from  TMODDM 
described  in  section  3.1.  Although  both  methods  involve  domain  decomposition;  in  the 
forward  problem,  the  DD  method  is  applied  on  a  partial  differential  equation,  whereas  in  the 
inverse  problem,  it  is  applied  to  an  optimization  problem. 

In  the  forward  problem,  we  obtain  the  value  of  the  photon  density  at  each  node,  whereas 
in  the  inverse  problem,  we  are  interested  in  the  constant  value  of  the  optical  coefficients  on 
each  element.  Note  that  this  is  a  direct  consequence  of  the  different  discretization  schemes 
followed  in  the  forward  and  inverse  problems.  Even  though  TMSDM  can  be  applied  for  any 
overlapping  domain  decomposition,  in  this  work,  we  use  the  same  domain  decomposition  in 
both  forward  and  inverse  problems  to  avoid  introducing  new  notation. 

Algorithm  1  TMODDM 

Ns :  The  number  of  sources. 

Nd '.  The  number  of  detectors. 

Mp\  The  maximum  number  of  sub-domain  correction  sweeps. 

ior  j  —  Ns  +  Nd  do 
Initialize  <I>°. 
for  n  =  1, . . . ,  Mp 

Start  coarse-level  correction 

rl  ^  (a  H-  C  H-  ~  /'^>  {Compute  the  residual} 

rjj  R  (rl )  Restrict  the  residual 

e^fj  (a  -H  C  -I-  2“  {Compute  the  error  on 

-H  P(e^fj)  {Update  the  photon  density} 
end  coarse-level  correction 

for  /  =  1, . . . ,  d  do 

«— 1+  — 

Update  <i>j  at  by  (3.22)  and  (3.23)  {Sub-domain  correction} 

end  for 
end  for 

end  for 

Compute  Ti  j  and  '^^(m),  ?^(m),  m  =  I, . . . ,  Ne  using  (2.4)  and  (2.7). 

{Post-processing} 
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Table  3.  The  notations  used  in  Algorithm  TMSDM. 

Initial  guess  for  the  optical  coefficients  /r  =  (fia,  /r') 
The  update  of  /t""*  by  the  coarse-level  inverse  solver 
at  the  nth  sweep  of  TMSDM 

y!r"~'''‘3+r  The  update  of after p  successive  sub-domain 
corrections  on  at  the  nth  sweep  of  TMSDM 


TMSDM  consists  of  two  steps:  Coarse-level  initiation  and  sub-domain  correction.  The 
coarse-level  initiation  step  provides  an  initial  guess  for  the  fine  level,  hy  solving  (2.9)  on  the 
coarse  level.  The  sub-domain  correction  involves  the  minimization  of  (2.9)  on  the  fine  level, 
iteratively  on  each  sub-domain  using  a  multiplicative  space  decomposition  method  (MSDM). 
In  the  following,  we  present  the  details  of  coarse-level  initiation  and  sub-domain  correction. 

We  list  the  notation  and  explanation  for  the  variables  used  in  aslgorithm  TMSDM  in 
table  3. 

3.2.1.  Coarse-level  initiation:  By  solving  a  smaller  sized  minimization  problem  formulated 
on  the  coarse  grid  ,  the  coarse-level  initiation  provides  a  computationally  viable 
approximation  for  the  optical  coefficient  estimates.  This  approximation  is  then  used  as  an 
initial  guess  at  the  first  sweep  of  the  sub-domain  correction  step.  In  theorem  1,  we  show  that 
the  local  convergence  of  the  sub-domain  correction  step  under  the  condition  that  the  initial 
guess  is  sufficiently  close  to  the  solution.  Therefore,  the  coarse-level  initiation  is  motivated  to 
achieve  the  convergence  of  the  sub-domain  correction.  The  coarse-level  initiation  step  can  be 
described  as  follows: 

Let  (/Xfl,  /x')°  denote  the  initial  guess  for  the  optical  coefficients  on  for  the  optical 
coefficients  (/Tq,  /r^).  The  coarse-level  initiation  starts  with  downsampling  (/Xa,  /x')°  onto  the 
coarse  grid 

{Uia,lJi'fH  (3.24) 

where  the  downsampling  operator  D  :  is  defined  by 

D(h)(mx,  nty)  —  ^[h(2mx  —  1,  2my  —  1)  +  h(2mx  —  1,  2my) 

+  h{2mx,  2my  —  1)  +  h(2mx,  2my)],  (3.25) 

for  I  ^  Mx  ^  Nx/2,  I  ^  My  ^  Ny/2,  h  e  where  Ne/A-  =  Nx/2  x  Ny/2  is  the  number 
of  elements  on  the  coarse  level.  Then,  we  formulate  the  inverse  problem  on  the  coarse  grid 
with  the  initial  guess  (/Xo,  /x')5^  and  seek  a  solution  to  the  following  minimization  problem: 

1 

=  argmin  Fh{ii,v), 

where  ^4  =  {??  e  is  constant  at  each  Tm,m  —  1, . . . ,  A^c/4}  for  y  —  iJia,  tis  and  Fh  '■ 
^n“/4  X  ^n‘/4  ^  R  is  given  by 

j  N,  Nj 

Fniil,  v)  =  2  -  Mijf  -\-aAHiri,  v), 

^  j=l  i=l 

and  Afj  :  V^“i^  x  ^  UfOl-  Then,  the  initial  guess  (/x^,  /x^)^  for  the  sub-domain 
correction  step  is  obtained  by  upsampling  (/x^,  to  the  fine  grid 

I^'sYh), 


(Fa, 


(3.27) 
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where  the  upsampling  operator  C/  :  ^  K^'‘’^^Mn(3.27)isthebilinearprolongation 

operator. 


3.2.2.  Sub-domain  correction:  Following  the  coarse  grid  initiation  step,  the  sub-domain 
correction  is  performed  on  the  hne  grid  Qf' .  In  this  step,  the  inverse  problem  (2.9)  is  solved  by 
successively  minimizing  the  objective  functional  in  (2.9)  with  respect  to  the  unknown  optical 
coefficients  on  each  sub-domain.  Thus,  the  optimization  problem  (2.9)  is  decomposed  into  a 
sequence  of  smaller  minimization  problems,  using  MSDM. 

Before  we  give  the  details  of  the  sub-domain  correction  step,  we  hrst  introduce  some 
notation.  Recall  that  {r\,v)  e  x  .  Then,  for  each  /  =  1,  . . . ,  we  can  decompose 
(jj,  v)  into  orthogonal  components  as  follows: 


(r],  v)i 

and 

(r],  v)c 


(ri,  i') 

in 

— w 

(0,  0) 

in 

(0,  0) 

in 

Q'f 

— W 

(v,  f) 

in 

Q.\Q,  , 

(3.28) 


(3.29) 


such  that  (rj,  v)  —  (rj,  v)i  -i-  (77,  y);/. 

The  first  sub-domain  correction  sweep  is  initiated  by  the  update  (/Xa,  /r')5n  provided  by 
the  coarse-level  initiation.  Let  (/Xa,  /x')"'^3fi  be  the  estimate  for  the  optical  coefficients  in  the 
(«  +  l)th  sub-domain  correction  sweep,  obtained  by  updating  (/Xq,  /U.^)"'^3^  with  the  solution 
estimates  obtained  on  the  sub-domains  ^2“ , . . . ,  Then,  the  inverse  problem  on  the 

sub-domain  ^2^  reads 


(/Xa,  il's)  =  argmin  f  ((77,  v)p  +  (77,  u)^/). 


(3.30) 


subject  to 

(r],v)p,  =  (/Xa,/x')p, 

where  F(ri,  v)  is  given  by  (2.10).  Thus,  the  minimization  is  performed  over  only  (77,  y)  on 
Qp.  Then,  the  solution  of  (3.30)  is  used  to  obtain  the  new  optical  coefficients  update  on  ^2  as 
follows: 


£jT 

d+\  = 


in  ^2^ 

^(/Xa,/x;)7^  in  ^2\^;. 

Successive  application  of  this  procedure  for  the  rest  of  the  sub-domains  f2" 


(3.31) 


I  = 


p  +  p  +  2, . . . ,  d  completes  the  {n  l)th  sweep  of  the  sub-domain  correction  step  and 
results  in  the  optical  coefficient  update  (/x^,  /x^)""^'  for  the  next  sub-domain  correction  sweep. 
We  refer  to  the  pseudo-code  in  algorithm  2  for  a  summary  of  the  coarse-level  initiation  and 
sub-domain  correction  steps  in  TMSDM. 


3.3.  Convergence  ofTMODDM  and  TMSDM 

Assume  that  the  mesh  size  of  the  hnite  element  formulation  is  0(h)  and  that  the  sub-domains 
are  of  diameter  0(H)  and  the  width  of  the  overlapping  region  is  0(SH),  where  0^5  <  1. 
Then  the  following  convergence  behaviour  is  known  for  algorithm  TMODDM  (see  section 
2.5  in  [33]). 

(a)  Convergence  is  poor  if  5  =  0,  but  improves  rapidly  as  S  increases. 
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(b)  If  S  is  fixed,  the  number  of  sub-domain  correction  sweeps  required  for  convergence  is 
bounded  independent  of  h,  H  and  H / h. 

(c)  The  number  of  sub-domain  correction  sweeps  required  for  convergence  is  roughly  half  of 
that  needed  for  the  additive  Schwarz  method. 


Algorithm  2  TMSDM 

d:  The  number  of  sub-domains. 

Mu'.  The  maximum  number  of  sub-domain  correction  sweeps. 
Initialize  /x®  on  O!' . 

Start  coarse-level  initiation 
^  DiifXa,  ft')") 

Solve  ilia,  =  argmin  Fh{ii,  v) 

^Ne/4^^Ne/‘i 

(/x„ /x;)"^®  ^[/((/x„,/x;)"„^®) 
end  coarse-level  initiation 
for  n  —  1 , . . . ,  M^  do 
for  I  =  1 , . . . ,  do 

(/X.,  K)  =  argmin^^^^^^^,„,,^^,;,,  Fi>^,  v) 

subject  to  (r],  v)/,  =  (fia, 

Update  the  optical  coefficients  by  (3.31) 

end  for 
end  for 


Therefore,  in  terms  of  convergence,  multiplicative  Schwarz  algorithms  are  more  advantageous 
as  compared  to  additive  Schwarz  algorithms.  This  follows  from  the  fact  that  the  multiplicative 
Schwarz  algorithms  take  the  advantage  of  immediate  use  of  the  solution  update  in  the 
successive  neighbouring  sub-problem.  On  the  other  hand,  in  additive  Schwarz  algorithms, 
one  computes  the  solutions  to  the  localized  sub-problems  independent  of  each  other  and  uses 
the  solution  updates  in  the  following  sub-domain  correction  sweep.  We  refer  to  [31]  and  [33] 
for  a  detailed  discussion  of  the  Schwarz  algorithms. 

The  local  linear  convergence  of  Algorithm  MSDM  is  shown  below,  using  the  results 
in  [5]. 


Theorem  1.  Let  A  he  the  regularization  operator  given  in  (2.11),  and  A'  and  A"  denote  the 
first-  and  second-order  Frechet  derivatives  of  A.  Assume  that  there  is  a  positive  constant  Ca 
such  that  y‘ A" y  ^  CAy'y  for  all  y  e  x 

Let  (fia,  he  the  nth  step  MSDM  approximation  of(fia,  /x^).  Assume  that 


Il(/Xfl,/X^)  (/Xa,/X^)^| 


X 


< 


aCA 

ChCj 


{q  —  0, . . .  ,n). 


Then, 


where  Ca  — 


||(/Xa,/X;)"-(M«,M;[ 


/8\"  oi^Ca 

'<rxv;;^(9j  ^[CaCx  +  CaCh], 


(3.32) 


(3.33) 


II -^^11  1‘a  /s  and  C j ,  C H  are  positive  constants  such  that 

^Ne 

II /x^)|l  ^  ^  Cj, 

''Ne  Ne  ‘ 


(334a) 
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*  ^  M 


^  c. 


i334b) 


Proof.  See  the  appendix  for  the  proof  of  theorem  1 . 

Note  that  C j  and  Ch  depend  on  the  frequency  u>.  For  the  multi-frequency  measurement 
case  with  Na,  frequencies  coi, ,  coff^,  let 


Cj  —  max  C^‘ , 


Ch  =  max  CS, 


(3.35) 


where  C“‘  and  C^‘  are  the  frequency-dependent  norm  bounds.  Then,  theorem  1  can  be 
extended  to  multi-frequency  measurement  case  in  a  straightforward  manner,  by  replacing  the 
norm  bounds  in  (3.34a)  and  (3.34£i)  by  the  bounds  given  in  (3.35). 

For  the  general  Tikhonov  regularizer,  A  is  chosen  as 

A(?7,  u)  =  ||Li(»?  -  +  \\L2(v  - 

Therefore,  if  Li  and  L2  are  positive  definite,  the  assumption  of  y‘  A" y  ^  C^y'y  in  theorem  1 
holds  since 


(?7,  vY A"(r],  v)  =  ri‘Liri  +  v‘ L2V. 

Clearly,  this  assumption  holds  for  the  zeroth-order  Tikhonov  regularization.  For  the 
appropriate  choice  of  the  discrete  approximation  of  the  differential  operator,  first-order 
Tikhonov  regularizer  also  satisfies  the  assumption  in  theorem  1. 

Note  that  in  [21],  we  have  shown  that  the  first-  and  second-order  Frechet  derivatives  of  the 
coefficient-to-measurement  operator  with  respect  to  (/Tq  ,  /r' )  exist  and  are  bounded.  Similarly, 
J  and  H,  being  finite  approximation  to  the  first-  and  second-order  Frechet  derivatives,  satisfy 
(3.34a)  and  (3.34h).  □ 


3.4.  Computational  cost  of  the  algorithms 

In  this  subsection,  the  computational  cost  of  the  proposed  algorithms  is  analysed.  We  will  call, 
the  method  solving  (2.5)  and  (2.15)  without  any  domain  decomposition,  the  non-DD  method 
and  compare  it  with  the  proposed  methods. 

The  system  matrices  for  each  equation  are  2N„  x  2N„  and  2Ne  x  2Ne,  respectively. 
Assume  that  we  require  0[N^)  and  0[Ne)  (I  ^  q  ^  3)  floating  point  operations  for  each 
equation  with  the  non-DD  method.  Note  that  for  a  full  nonzero  matrix,  q  —  3  and  for  a 
diagonal  matrix,  q  —  \.  Suppose  that  we  use  d  sub-domains  with  equal  nodes  and  elements 
for  the  proposed  algorithms. 

The  coarse-level  computation  needs  4“^  times  (in  two  dimensions)  or  8“^  times  (in  three 
dimensions)  the  computations  required  on  the  fine  level  computation  for  the  forward  and 
inverse  solvers.  Thus,  if  we  neglect  the  coarse-level  computation,  the  computational  cost  of 
algorithm  MODDM  is  (for  one  computer)  and  Mpd~‘‘  (for  d  parallel  computers) 

times  that  of  the  non-DD  method,  where  Mp  is  the  maximum  number  of  sub-domain 
correction  sweeps.  Recall  that  Mp  is  the  maximum  sweep  of  all  sub-domain  corrections 
followed  by  the  coarse  grid  correction.  It  is  well  known  that  for  a  given  Mp,  the  convergence 
behaviour  of  TMODDM  is  independent  of  N„,  d  and  Nn/d  (see  (a)-(c)  below  subsection  3.3). 
Furthermore  TMODDM  has  good  convergence  behaviour  and  it  has  better  convergence 
behaviour  for  smaller  Mp  as  compared  to  MODDM.  In  this  work,  we  have  chosen  Mp  =  3 
when  TMODDM  is  used  in  the  inverse  solver  in  section  4.2.  This  analysis  shows  that 
by  using  multiple  sub-domains,  algorithm  TMODDM  can  achieve  significant  reduction  in 
computational  requirements. 
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Table  4.  The  comparison  of  the  computational  costs  for  the  non-DD  method,  TMSDM  on  one, 
d,  and  computers,  q  is  a  constant  between  1  and  3  depending  on  the  sparsity  of  the  system 
matrix.  In  the  table,  N„  is  the  number  of  nodes,  Ne  is  the  number  of  elements,  d  is  the  number  of 
sub-domains,  Mp  is  the  number  of  sweeps  for  the  forward  solver  and  My  is  the  number  of  sweeps 
for  the  inverse  solver. 


Method 

Computational  cost 

Forward  solver 

Inverse  solver 

non-DDM 

0{Nh 

0{Nl) 

TMSDM  on  1  computer 

Mpd^-'iOiK) 

Myd^~iO(Nl) 

TMSDM  on  d  computers 

Mpd^-'iOiK) 

Myd^iQiNl) 

TMSDM  on  computers 

Mpd-iOiN?,) 

Myd-‘iO(N^) 

The  computational  cost  of  algorithm  TMSDM  with  algorithm  TMODDM  as  the  forward 
solver,  and  the  computational  cost  of  the  non-DD  method  are  tabulated  in  table  4,  using  1, 
d  and  computers.  When  d  parallel  computers  are  used,  the  parallel  computing  is  applied 
to  the  inverse  problem  only,  whereas  when  d^  computers  are  used,  the  parallel  computing 
is  applied  to  both  the  inverse  and  forward  problems.  Note  that  this  comparison  for  the 
inverse  solver  is  based  on  the  assumption  that  the  speed  of  data  communication  between 
parallel  computers  is  sufficiently  fast.  Parallel  computation  is  not  treated  in  this  paper,  but 
the  proposed  algorithms  combined  with  the  parallel  computing  is  expected  to  provide  more 
efficient  results. 

4.  Numerical  simulations 

In  this  section,  we  demonstrate  the  performance  of  the  algorithms  TMODDM  and  TMSDM 
using  simulated  optical  data.  In  section  4.1,  we  conduct  a  series  of  experiments  to  test  and 
compare  the  performance  of  TMODDM  to  that  of  the  one-level  DD  method.  In  section  4.2, 
we  test  TMSDM  in  a  number  of  experiments,  where  we  consider  /r-a-only,  /x'-only  and 
simultaneous  /Xq  and  reconstructions.  For  the  summary  of  method  descriptions,  we  refer 
to  table  1 . 

In  our  experiments,  we  set  the  angular  frequency  co  —  2jt  x  100  MHz,  and  a  —  1,  except 
for  the  simultaneous  absorption  and  reduced  scattering  imaging  case,  where  we  use  two 
frequencies:  coi  =  300  MHz  and  a>2  =  500  MHz.  Note  that  for  the  /Xa-only  simulation,  zero- 
mean  Gaussian  noise  with  standard  deviation  equal  to  1  %  of  the  average  of  the  measurements 
was  added  to  the  measurements.  Similarly,  for  the  /x^-only  case,  we  added  zero-mean  Gaussian 
noise  with  standard  deviation  equal  to  0.5%  of  the  average  of  the  measurements.  For  the 
simultaneous  /x^  and  /x^  imaging,  we  considered  additive  zero-mean  Gaussian  noise  with 
standard  deviation  equal  to  5%  of  the  average  of  the  measurements.  For  the  inverse  problem 
formulation,  we  chose  the  following  zeroth-order  Tikhonov  regularizer  with  the  regularization 
parameter  O'  =  10“^: 

Mri,  v)  =  \\rify,a  +  (4.36) 

for  7]  e  and  v  e 
4.1 .  Algorithm  TMODDM 

In  this  experiment,  we  evaluate  the  performance  of  TMODDM,  for  an  optical  medium 
whose  absorption  and  scattering  coefficient  distributions  are  shown  in  figures  2(a)  and  (b), 


16 


K  Kwon  et  al 


Figure  2.  (a)  The  original  fia  image.  The  white  region  represents  background  tissue  with  fia  value 
of  0.05  cm“^,  and  the  black  region  represents  an  anomaly  with  fia  value  of  0.2  cm“^.  (b)  The 
original  image.  The  white  region  represents  background  tissue  with  /x'  value  of  8  cm~^,  and 
the  black  region  represents  an  anomaly  with  /x'  value  of  1 6  cm~  ^ .  (c)  2  x  2  domain  decomposition 
of  =  [0,  6]  X  [0,  6].  (d)  The  LINPACK  solution  of  the  frequency-domain  diffusion  equation  for 
a  point  source  located  at  (6,  3).  (e)  Relative  L2  error  versus  number  of  floating  point  operations 
for  one  and  two-level  MODDM.  (f)  Relative  L2  error  versus  number  of  floating  point  operations 
for  TMODDM  with  domain  decompositions  with  the  overlap  region  width  w  =  1,2  and  3  pixels. 
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respectively.  The  black  region  in  figure  2(a)  shows  an  anomaly  with  absorption  coefficient 
fXa  —  0.2  cm~',  where  the  background  has  an  absorption  coefficient  fia  —  0.05  cm~'. 
Figure  2(b)  shows  the  reduced  scattering  coefficient  of  the  same  medium,  where  the  black 
region  and  the  background  have  reduced  scattering  coefficients  /x'  =  16  and  /x'  =  8  cm~', 
respectively.  We  divide  the  square  domain  ^2  =  [0,  6]  x  [0,  6]  cm^  into  32  x  32  uniform 
pixels.  Next,  we  decompose  the  domain  into  four  overlapping  sub-domains  with  2/3  cm  of 
overlap  region,  as  shown  in  figure  2(c). 

We  consider  fhe  solution  of  the  diffusion  equation  (2.1)  for  a  point  source  located  at  (6,  3) 
and  compare  the  performance  of  TMODDM  and  the  one-level  DD  method  (MODDM)  using 
the  following  relative  L^(f2)  error: 


11^^  IIl2(Q) 


(4.37) 


where  tt>^  is  the  LINPACK  solution  of  (2.5),  and  <t>"^  is  either  the  TMODDM  or  MODDM 
solution.  Figure  2(d)  shows  the  LINPACK  solution  for  the  point  source  located  at  (6,  3). 
Figure  2(e)  shows  the  relative  L^(f2)  error  versus  the  number  of  floating  point  operations  for 
TMODDM  and  MODDM.  We  observe  that  TMODDM  converges  faster  than  the  one-level 
MODDM,  with  a  smaller  number  of  floating  point  operations. 

Next,  to  show  the  effect  of  overlap  width  on  the  performance  of  TMODDM,  we  consider 
three  cases  where  the  overlap  region  consists  of  1,  2  and  3  pixels,  respectively.  Figure  2(f) 
shows  the  relative  L^(Q.)  error  versus  the  number  of  floating  point  operations  for  each  overlap 
width.  We  see  that  the  convergence  improves  as  the  width  gets  larger.  This  verihes  the 
convergence  result  stated  in  (a)  below  subsection  3.3. 


4.2.  Algorithm  TMSDM 


In  this  section,  we  evaluate  the  performance  of  TMSDM  in  two  sets  of  experiments  using 
simulated  data.  In  the  first  experiment,  we  consider  /x^  -only  and  /x^  -only  reconstructions  and 
compare  TMSDM  to  the  non-DD  method.  In  the  second  experiment,  we  consider  simultaneous 
reconstruction  of  /x^,  and  /x'  using  TMSDM  and  compare  the  performance  to  that  of  the  non-DD 
method.  For  the  description  of  the  non-DD  method,  we  refer  to  table  1 . 

For  the  forward  solver,  we  use  TMODDM  with  the  maximum  number  of  sub-domain 
correction  sweeps  set  to  3.  In  order  to  evaluate  the  performance  of  the  inverse  solvers 
and  compare  TMSDM  to  the  non-DD  method,  we  define  the  following  signal-to-noise-ratio 
(SNR): 


SNR  =  201ogio 


lly|lL^(S2) 

-  yllL2(f2)’ 


(4.38) 


where  y'^  is  the  actual  optical  coefficient  (either  fia  or  /x')  and  y  denotes  the  reconstructed 
optical  coefficients,  obtained  by  using  either  TMSDM  or  the  non-DD  method. 


4.2.1.  Experiment  1.  In  this  experiment,  we  consider  two  cases.  In  the  first  one,  we 
assume  that  the  reduced  scattering  coefficient  of  the  medium  is  known  and  set  it  to  8  cm“*, 
and  we  reconstruct  the  absorption  coefficient  of  the  medium.  Figure  3(a)  shows  the  circular 
heterogeneity  with  /x^  =  0.20  cm“^  embedded  in  a  background  with  /x^  =  0.05  cm~'. 

In  the  second  case,  we  assume  that  the  absorption  coefficient  of  the  medium  is  known  and 
is  set  to  fXa  —  0.05  cm“'  and  we  reconstruct  the  reduced  scattering  image.  Figure  4(a)  shows 
the  circular  heterogeneity  with  /x'  =  16  cm”'  embedded  in  a  background  with  /x'  =  8  cm”*. 
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(b) 


SNR  vs.  Number  of  flops  (mua) 


SNR(mus)  15.5 

(e) 


SNR(mus)  15.6 


Figure  3.  The  set-up  and  the  results  regarding  the  /Xa-only  reconstruction,  (a)  The  original  jia 
image  and  the  source-detector  configuration.  The  white  region  represents  background  tissue  with 
(la  value  of  0.05  cm~^,  and  the  black  region  represents  an  anomaly  with  (Iq  value  of  0.2  cm~^. 
(b)  The  2x2  domain  decomposition  and  uniform  32  x  32  discretization  of  Q.  (c)  The  value 
of  the  objective  functional  versus  the  number  of  floating  point  operations  of  the  non-DD  method 
and  TMSDM.  (d)  The  SNR  versus  the  number  of  floating  point  operations  of  the  non-DD  method 
and  TMSDM.  (e)  The  absorption  image  reconstruction  by  the  non-DD  method  after  12.893  x  10^ 
number  of  floating  point  operations,  with  SNR  =  8.174.  (f)  The  reconstruction  by  TMSDM  after 
8.561  X  10^  number  of  floating  point  operations  with  SNR  =  9.955. 
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*0*0*0*0^0*0*0*0* 


0  (a) 

* 

0 

0 


*0*0*0*0*0*0*0*0* 


(b) 


Number  of  flops 


SNR  vs.  Number  of  flops  (mus) 


SNR{mus)  15.5 


(f) 


SNR(mus)  15.6 


Figure  4.  The  set-up  and  the  results  regarding  the  /x^-only  reconstruction,  (a)  The  original  /x' 
image  and  the  source-detector  configuration.  The  white  region  represents  background  tissue  with 
//'  value  of  8  cm“^ ,  and  the  black  region  represents  an  anomaly  with  //'  value  of  16  cm““^ .  (b)  The 
2x2  domain  decomposition  and  uniform  32  x  32  discretization  of  Q.  (c)  The  value  of  the  objective 
functional  versus  the  number  of  floating  point  operations  of  the  non-DD  method  and  TMSDM. 

(d)  The  SNR  versus  the  number  of  floating  point  operations  of  the  non-DD  method  and  TMSDM. 

(e)  The  reduced  scattering  image  reconstruction  by  the  non-DD  method  after  13.165  x  10^  number 
of  floating  point  operations,  with  SNR  =  15.5.  (f)  The  reconstruction  by  TMSDM  after  8.373  x  10^ 
number  of  floating  point  operations  with  SNR  =  15.6. 
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SNR  vs.  Number  of  flops  (mua) 


SNR  vs.  Number  of  flops  (mus) 


Figure  5.  The  set-up  and  the  results  regarding  the  simultaneous  fia  and  /x'  reconstruction,  (a)  The 
original  image  and  the  source-detector  configuration.  The  white  region  represents  background 
tissue  with  fia  value  of  0.05  cm“^,  and  the  black  region  represents  an  anomaly  with  value  of 
0.2  cm“^ .  (b)  The  original  image.  The  white  region  represents  background  tissue  with  /x'  value 
of  8  cm~^,  and  the  black  region  represents  an  anomaly  with  value  of  16  cm~^.  (c)  The  2x2 
domain  decomposition  and  uniform  32  x  32  discretization  of  Q.  (d)  The  value  of  the  objective 
functional  versus  the  number  of  floating  point  operations  of  non-DD  method  and  TMSDM.  (e)  The 
SNR  for  the  absorption  coefficient  reconstruction  versus  the  number  of  floating  point  operations  of 
non-DD  method  and  TMSDM.  (f)  The  SNR  for  the  reduced  scattering  coefficient  reconstruction 
versus  the  number  of  floating  point  operations  of  non-DD  method  and  TMSDM. 
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SNR(mua)  9.35  SNR(mus)  13.9 

(b) 


SNR(mua)  8.92  SNR(mus)  14.9 

(d) 


* 


Figure  6.  The  simultaneous  reconstruction  results  of  the  absorption  and  reduced  scattering 
coefficients  shown  in  figures  5(a)— (b).  (a),(b):  the  simultaneous  reconstruction  of  the  absorption 
and  reduced  scattering  coefficients  by  using  the  non-DD  method  after  6.780  x  10***  number 
of  floating  point  operations  with  SNR  (/ia)  =  9.346  and  SNR  (/x')  =  13.935.  (c),(d):  the 
simultaneous  reconstruction  of  the  absorption  and  reduced  scattering  coefficients  by  using  TMSDM 
after  2.459  x  10*°  number  of  floating  point  operations,  with  SNR  (/Xa)  =  8.920  and  SNR 
(/x;)  =  14.899. 


Figures  3(c)-(d)  and  figures  4(c)-(d)  show  the  value  of  the  objective  functional  and  the 
SNR  versus  the  number  of  floating  point  operations.  Figures  3(e),  4(e)  and  figures  3(f),  4(f) 
show  the  reconstructed  images  using  non-DD  and  TMSDM  methods,  respectively. 

Figures  3— 4(c)  show  that  at  a  given  number  of  floating  point  operations,  TMSDM  achieves 
a  lower  objective  functional  value  as  compared  to  the  non-DD  method.  Similarly,  figures  3- 
4(d)  show  that  at  a  given  number  of  floating  point  operations,  TMSDM  achieves  a  higher  SNR 
value. 

For  the  /x,J,-only  imaging,  the  reconstructed  images  shown  in  figures  3(e)-(f)  indicate  that 
TMSDM  provides  qualitatively  and  quantitatively  better  results  as  compared  to  the  non-DD 
method.  Note  that  the  reconstructed  /Xq  image  using  TMSDM  was  obtained  after  8.561  x  10® 
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floating  point  operations,  while  the  image  using  the  non-DD  method  was  obtained  after 
12.893  X  10^  floating  point  operations.  Similarly  flgures  4(e)-(f)  show  the  reconstructed 
/x'  images  obtained  after  13.165  x  10®  and  8.373  x  10®  floating  point  operations,  by  using 
the  non-DD  method  and  TMSDM,  respectively.  Similar  to  the  /Xa-only  imaging  case,  these 
images  indicate  that  TMSDM  outperforms  the  non-DD  method. 

4.2.2.  Experiment  2.  In  this  experiment,  we  test  the  performance  of  TMSDM  in  the 
simultaneous  reconstruction  of  absorption  and  reduced  scattering  images  and  show  the 
convergence  of  the  algorithm  experimentally. 

Figure  5(a)  shows  an  inclusion  with  jXa  =  0.20  cm“'  in  a  background  with  pia  — 
0.05  cm“'.  Figure  5(b)  shows  the  reduced  scattering  coefficient  of  the  same  medium,  where 
the  circular  inclusion  corresponding  to  an  object  with  =  16  cm“'  is  embedded  in  a 
background  with  /x'  =  8  cm“'.  Figure  5(c)  shows  the  domain  decomposition  with  uniform 
discretization  for  the  forward  and  inverse  problems. 

Figure  5(d)  shows  the  value  of  the  objective  functional  versus  the  number  of  floating  point 
operations  for  both  TMSDM  and  the  non-DD  method.  We  see  that  TMSDM  achieves  a  lower 
objective  functional  value  at  a  lower  number  of  floating  point  operations. 

Figures  5(e)  and  (f)  show  the  SNR  value  achieved  by  TMSDM  and  the  non-DD  method 
versus  the  number  of  floating  point  operations,  respectively  for  absorption  and  reduced 
scattering  images.  These  images  indicate  that  TMSDM  outperforms  the  non-DD  method, 
especially  for  the  reduced  scattering  image.  The  reconstructed  images  shown  in  figure  6  are 
consistent  with  the  performance  numbers  given  in  figures  5(d)-(e)  and  (f).  We  note  that  the 
images  reconstructed  by  using  TMSDM  are  obtained  in  2.459  x  10'®  number  of  floating  point 
operations,  while  the  images  obtained  by  using  the  non-DD  method  require  6.780  x  10*® 
number  of  floating  point  operations. 


5.  Conclusion 

In  this  work,  we  developed  two-level  overlapping  domain  decomposition  algorithms  to  address 
the  computational  complexity  of  the  forward  and  inverse  problems  associated  with  DOT 
imaging.  We  used  the  frequency-domain  diffusion  equation  to  model  NIR  light  propagation. 
In  order  to  address  the  ill-posed  nature  of  the  inverse  problem,  we  used  a  nonlinear  least-squares 
formulation  with  a  general  Tikhonov  regularization  term  to  recover  both  the  absorption  and 
scattering  coefficients. 

In  the  forward  problem,  we  employed  an  overlapping  domain  decomposition  algorithm 
with  a  two-grid  preconditioner  (TMODDM),  and  for  the  nonlinear  inverse  problem,  we  used 
an  overlapping  space  decomposition  algorithm  with  a  coarse-level  initiation  (TMSDM).  We 
proved  the  local  convergence  of  the  TMSDM  method  under  the  conditions  that  lead  to  the 
strict  local  convexity  of  the  objective  functional  formulated  for  the  inverse  problem.  For 
notational  brevity,  we  described  both  DD  algorithms  for  a  2D  bounded  optical  domain  and 
uniform  discretization.  Nevertheless,  the  extension  of  the  algorithm  for  the  3D  case  and 
adaptive  discretization  is  straightforward.  This  requires  replacing  the  2D  finite  elements 
with  3D  finite  elements  and  definition  of  3D  restriction  (downsampling)  and  prolongation 
(upsampling)  operators  for  the  two-level  algorithms. 

Note  that  in  our  inverse  problem  solver,  measurements  from  all  source-detector  pairs  are 
used  for  each  sub-domain.  This  accounts  for  the  contribution  of  all  sources  to  the  boundary 
data.  Therefore,  this  scheme  does  not  impose  any  constraints  on  how  the  image  domain  is 
decomposed  into  sub-domains. 
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We  analysed  the  computational  complexity  of  both  algorithms  and  demonstrated  their 
performance  in  three  different  numerical  simulations,  where  we  considered  simultaneous 
absorption  and  reduced  scattering  coefficient  reconstruction  as  well  as  absorption  only  and 
reduced  scattering  only  reconstructions.  Our  study  shows  that  TMODDM  provides  lower 
relative  error  than  the  one-level  MODDM  for  the  same  floating  point  operations  and  the 
relative  error  becomes  much  lower  as  the  width  of  the  overlapping  region  grows.  Similarly, 
TMSDM  provides  lower  objective  functional  values  and  higher  SNR  than  the  non-DD 
method  for  the  same  number  of  floating  point  operations,  in  all  experiments  including  the 
simultaneous  absorption  and  reduced  scattering  reconstruction.  If  parallel  computers  are 
used,  the  computational  efficiency  of  TMSDM  is  expected  to  be  further  enhanced. 

Finally,  the  local  convergence  properties  of  the  algorithms  do  not  change  when  they  are 
implemented  only  for  a  region  of  interest  (ROI).  Therefore,  if  an  ROI  is  identified  either  by 
a  priori  information  provided  by  a  secondary  imaging  modality  such  as  magnetic  resonance 
or  x-ray;  or  by  a  posteriori  information  obtained  from  the  coarse-level  solution,  the  optical 
coefficients  can  be  updated  only  in  the  sub-domains  covering  the  ROI,  providing  further 
reduction  in  computational  requirements. 
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Appendix  A.  The  proof  of  theorem  1 


Lemma  1.  Let  /x  =  (fia,  Ms)  ]!  —  fj.  +  Sp,  Sp  —  {Spa,  Sp'J-  lf\\Sp\\ 
for  some  e  >  0,  then  F"  satisfies 
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e\\Sp\\ 


VS^XV^ 
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Proof.  Let  G  and  G  be  Green’s  function  for  the  optical  coefficients  p  and  p,  respectively. 
Then,  we  get 
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For  the  norm  boundedness  of  IZ,  see  [21]. 
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Therefore,  using  (2.13),  (3.34),  and  (A. 2),  we  get 

Sfx,'  F"SiJ,  =  (7i5/r)'(7i5/r)  +  Sfi'  HbSfi  +  aSfi,'  A"Sfj,  (A. 2) 

>  (JSfiYiJSfx.)  +  aSf^‘ A''8fx  —  CHSfx.‘ Sfj.\b\  (A. 3) 

^  (kCa  -  CHCj\St£\)Six'St£.  (A.4) 

Thus,  we  have  proved  the  lemma.  □ 


The  inequality  (A.l)  is  called  the  local  strong  convexity.  With  this  property  and  the 
theorem  in  [5],  we  will  prove  theorem  1. 


Proof  of  theorem  1.  The  proof  of  theorem  1  is  based  on  the  proof  in  theorem  3.1  [5]. 
Let  fjL  =  (/Tfl,  and  5/x  =  (3/ra,  ^/x').  Let  the  restrictions  of  ,  y  =  /Xa,  /x^  to  the  non¬ 
overlapping  and  overlapping  sub-domain  be  W^/'^  and  for  p  =  I, ...  ,d,  respectively, 
such  that 


WY/'f  =  {x  e  VYiJx  =  0  on 
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where  y  =  f^a,  d's-  Then,  p  —  I, 


,  d  are  mutually  disjoint,  W^/'f  C  ^^d 


Since  we  did  not  consider  the  coarse-level  correction  step  in  this  theorem,  we  will  use  p  and 
d  instead  of  p  H-  1  and  d  +  \  \n  (3.31).  Let  /x”  be  the  solution  of  MSDM  after  n  sweeps  of 
sub-domain  correction  and  /x"'^?  be  the  solution  obtained  by  updating  /x”  with  the  solution 
estimates  obtained  on  the  sub-domain  . . . ,  f2“(/7  ^  d).  Define 
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(A.  8)  implies  that 
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where  { •,  •)  is  inner  product  in  the  space  y^°’''  x  y^*’^.  Using  (A.  1)  and  Taylor  expansion 
for  F,  we  get 
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V  —  /x"'^<i  in  (A. 10)  and  using  (A.9)  with  Vp  —  0,  we  get 
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Let  dn  —  F{(ji^)  —  F(/x),  then  the  above  equation  and  the  Cauchy-Schwarz  inequality  implies 
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for  all  T]  >  0.  Thus,  we  obtain 
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The  left-hand  side  of  (A.  14)  takes  minimum  value  |  <  1  when  rj  — 
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Using  the  mean-value  theorem,  (2.13fl),  (3.32),  (3.34a)  and  (A. 2),  we  can  estimate  do  as 
follows: 
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Abstract.  The  inverse  DOT  problem  involves  the  inversion  of  the  associated 
coefficient-to-measurement  operator  which  maps  the  spatially  varying  optical 
coefficients  of  turbid  medium  to  the  boundary  measurements.  The  inversion  of  the 
coefficient-to-measurement  operator  requires  the  computation  of  the  Frechet  derivative 
of  the  coefficient-to-measurement  operator.  In  this  work,  we  derive  the  Frechet 
derivatives  of  the  coefficient-to-measurement  operator  on  appropriate  normed  spaces, 
by  making  use  of  the  relationship  between  the  Frechet  derivatives  and  Born  expansion. 
In  this  context,  we  first  analyze  the  Born  expansion,  show  the  conditions  which  ensure 
the  existence  and  convergence  of  the  Born  expansion,  and  compute  the  error  in  the 
m-th  order  Born  approximation.  Then,  we  derive  the  m-th  order  Frechet  derivatives 
of  the  coefficient-to-measurement  operator  and  show  the  relationship  of  the  Frechet 
derivatives  to  the  terms  in  the  Born  expansion. 


I  corresponding  author  :  yazici@ecse.rpi.edu 


Born  expansion  and  Frechet  derivatives  in  diffuse  optieal  tomography 

1.  Introduction 


2 


Diffuse  optical  tomography  (DOT)  in  near  infrared  involves  determining  the  spatially 
resolved  optical  properties  of  a  turbid  medium  from  boundary  measurements  based  on 
a  photon  propagation  model.  For  deep  tissue,  frequency  domain  diffusion  equation 
with  Robin  boundary  condition  constitutes  a  good  model  for  photon  propagation. 
Coefficients  of  the  diffusion  equation,  namely  the  absorption  and  reduced  scattering 
coefficients,  represent  the  optical  properties  of  the  medium.  As  a  result,  the  inverse  DOT 
problem  involves  the  inversion  of  the  associated  coefficient-to-measurement  operator 
which  maps  the  coefficients  of  the  diffusion  equation  to  the  boundary  measurements. 
The  inversion  of  the  coefficient-to-measurement  operator  calls  for  the  computation  of 
the  Frechet  derivative  of  the  coefficient-to-measurement  operator.  Thus,  the  Frechet 
derivative  of  the  coefficient-to-measurement  operator  is  an  integral  part  of  the  algorithms 
developed  for  DOT  imaging. 

In  this  work,  we  derive  the  m-th  order  Frechet  derivative  of  the  coefficient-to- 
measurement  operator  in  various  normed  spaces  for  both  the  absorption  and  reduced 
scattering  coefficients.  The  Frechet  derivative  of  the  coefficient-to-measurement  operator 
is  related  to  the  Frechet  derivative  of  the  coefficient-to-solution  operator,  which  can  be 
interpreted  as  the  ratio  of  the  perturbation  in  the  photon  density  to  an  inhnitesimal 
perturbation  in  the  optical  coefficients.  Similarly,  the  Born  expansion  in  DOT  can 
be  regarded  as  the  Neumann  series  expansion  of  the  perturbed  photon  density,  given  in 
terms  of  the  unperturbed  photon  density  and  the  perturbation  in  the  optical  coefficients. 
Hence,  the  Frechet  derivatives  and  the  Born  expansion  are  closely  related.  In  this 
respect,  we  hrst  analyze  the  Born  expansion  and  investigate  the  conditions  which  ensure 
the  convergence  of  the  Born  expansion.  Next,  we  derive  the  Frechet  derivative  of 
coefficient-to-solution  operator,  and  then  extend  our  results  to  the  Frechet  derivative  of 
the  coefficient-to-measurement  operator. 

Before  we  give  the  technical  details  of  our  analysis  and  results,  we  will  describe  how 
our  work  differs  from  the  previously  reported  studies  in  the  general  inverse  scattering 
problem  and  particularly  in  DOT.  In  this  context,  we  present  an  extensive  literature 
survey  on  Born  expansion  and  Frechet  derivatives,  respectively,  and  give  an  ontline  of 
onr  analysis  and  hndings  regarding  the  Born  expansion  and  Frechet  derivatives  in  DOT. 

1.1.  Born  Expansion 

In  this  section,  we  describe  the  related  work  and  the  contribution  of  our  work  in  the 
area  of  Born  expansion. 

We  consider  the  following  aspects  in  our  Born  expansion  analysis:  First,  we  stndy 
the  validity  of  the  Born  expansion  by  deriving  bonnds  of  the  integral  operators  in  the 
Born  expansion.  In  this  context,  we  first  show  the  conditions  to  ensure  the  existence 
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of  the  Born  expansion.  Second,  we  show  the  restrictions  on  the  optical  coefficients 
that  guarantee  the  convergence  of  the  m-th  order  Born  approximation  and  compute  the 
error  in  the  m-th  order  Born  approximation.  Third,  we  analyze  the  Born  expansion 
not  only  for  the  perturbation  of  the  absorption  coefficient  but  also  for  the  reduced 
scattering  coefficient.  Fourth,  the  analysis  of  the  Born  expansion  is  done  for  the  n- 
dimension  for  any  n  >  2.  Fifth,  we  summarize  the  existence  and  singular  properties 
of  the  Robin  function,  which  is  the  kernel  of  the  integral  operator  for  each  term  in  the 
Born  expansion.  Note  that  we  use  the  term  Robin  function  for  the  Green’s  function  of 
the  diffusion  equation  to  stress  that  the  Green’s  function  satishes  the  (homogeneous) 
Robin  boundary  condition. 

Studies  on  the  Born  expansion  in  the  area  of  quantum  scattering  were  initiated 
with  the  establishment  of  the  Lippmann-Schwinger  equation  [9,  22].  The  beam  of 
particles  of  light  can  be  modelled  by  the  Schrodinger  equation.  Then,  the  scattered  held 
due  to  the  perturbation  in  the  refractive  index  can  be  represented  by  the  Lippmann- 
Schwinger  equation,  which  is  an  implicit  integral  representation  of  the  scattered  held 
solution.  Using  the  Lippmann-Schwinger  equation  recursively,  the  formal  Neumann 
series  expansion  of  the  scattered  held  with  respect  to  the  incident  wave  gives  the  Born 
expansion.  However,  in  quantum  mechanics,  there  are  relatively  few  systems  which  are 
of  physical  interest  and/or  can  be  solved  exactly.  Therefore,  approximation  methods 
such  as  Born  approximation  are  widely  used  [4,  30,  32,  34].  We  note  that  acoustic 
scattering  can  also  be  considered  as  a  case  of  quantum  scattering,  since  the  Helmholtz 
equation  is  a  special  case  of  the  Schrodinger  equation. 

The  analysis,  in  this  paper,  for  the  validity  of  the  Born  expansion  and  the  error 
in  the  Born  expansion  dihers  from  the  analysis  in  quantum  and  acoustic  scattering 
[3,  4,  6,  17,  18,  23,  29,  30,  32,  34,  35,  37]  in  the  following  aspects:  First,  in  these 
studies,  the  scattered  wave  is  considered  in  an  unbounded  domain  with  spatially 
constant  background  properties  of  interest.  Thus,  the  associated  Green’s  function  is 
explicitly  known.  However,  we  consider  the  Robin  boundary  condition  for  arbitrary 
bounded  domains  and  spatially  varying  background  optical  coefficients.  Therefore,  the 
existence,  singularities,  and  other  properties  of  the  Robin  function  are  not  known  a 
priori.  Although  the  Green’s  function  of  the  diffusion  equation  in  specihc  geometries 
with  specihc  optical  coefficients  is  known  analytically  [24,  25,  26,  27],  to  the  best  of 
our  knowledge,  studies  on  the  existence  and  singularities  of  the  Robin  function  for 
arbitrary  geometries  in  which  the  Robin  function  is  not  known  analytically,  have  not 
been  reported.  Thus,  we  study  the  singularity  properties  of  the  Robin  function  using 
[5,  36],  based  on  the  dehnition  of  the  Robin  function  given  in  [28].  Second,  in  quantum 
and  acoustic  scattering  theory,  only  the  perturbation  in  the  refractive  index,  which 
corresponds  to  the  absorption  coefficient  in  DOT,  has  been  considered.  In  this  work,  we 
consider  the  perturbation  with  respect  to  both  the  absorption  and  reduced  scattering 
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coefficients.  We  note  that  the  analysis  of  the  Born  expansion  for  the  reduced  scattering 
coefficient  requires  more  sophisticated  mathematical  machinery  as  compared  to  the 
analysis  of  the  Born  expansion  for  the  absorption  coefficient.  This  complication  results 
from  the  presence  of  the  gradients  of  the  Robin  function  and  the  unperturbed  photon 
density  in  the  Born  expansion  for  the  reduced  scattering  coefficient.  Third,  we  establish 
a  relationship  between  the  Born  expansion  and  the  Frechet  derivatives  of  the  coefficient- 
to-solution  operator. 

1.2.  Freehet  derivative 

In  this  section,  we  describe  the  related  work  and  the  contribution  of  our  paper  in  the 
area  of  Frechet  derivatives. 

A  number  of  studies  on  the  derivation  of  the  Frechet  derivatives  have  been  reported 
in  inverse  acoustic  scattering  problem  [3,  11,  16,  31]  and  in  electrical  impedance 
tomography  [12,  13].  In  these  studies,  Frechet  derivatives  are  either  given  by  the 
solution  of  partial  differential  equations  using  weak  formulation  [3,  11,  12,  16]  or  by  the 
solution  of  integral  equation  systems  [13,  31].  Although  these  studies,  for  example  [3],  is 
potentially  applicable  to  DOT,  most  researchers  in  DOT  use  the  perturbation  method 
and  the  hrst  order  Born  approximation  to  approximate  the  hrst  order  Frechet  derivative 
[2].  The  derivation  of  the  first  order  Frechet  derivative  is  straightforward;  however  the 
higher  order  terms  in  the  Born  expansion  are  usually  discarded  regardless  of  the  relative 
magnitude  of  the  higher  order  terms  with  respect  to  the  first  order  terms.  Ye  et.  al 
[38]  derived  the  Frechet  derivative  of  the  coefficient-to-measurement  operator  using  the 
perturbation  method  without  using  the  first-order  Born  approximation.  However,  in 
that  work,  the  Robin  function  is  assumed  to  be  bounded,  which  is  not  valid.  In 
contrast,  in  our  work,  we  showed  and  used  the  argument  that  the  convolution  of  the 
Robin  function  and  any  function  is  bounded  [5,  36].  Dierkes  et.  al  [7]  derived 
the  first  order  Frechet  derivative  for  DOT,  where  a  Dirichlet  boundary  problem  with 
zero  source  is  considered  for  the  derivation,  which  is  different  from  the  model  used  in 
this  paper. 

In  this  paper,  we  hrst  derive  the  Frechet  derivatives  of  the  coefficient-to-solution 
operator,  then  we  obtain  the  Frechet  derivatives  of  the  coefficient-to-measurement 
operator  by  change  of  variables.  The  approach  followed  in  deriving  the  Frechet 
derivatives  in  this  paper  differs  from  the  approaches  in  [2,  3,  7,  11,  12,  13,  16,  31,  38]  in 
the  following  aspects:  First,  we  compute  the  Frechet  derivatives  of  any  order.  Second, 
we  use  various  normed  spaces  for  the  dehnition  of  the  Frechet  derivatives,  such  as 
Lebesque  space  1  <  p  <  oo,  the  hrst  order  Sobolev  space  W^,l  <  p  <  oo,  and 
the  weighted  Sobolev  space  IF°l“  for  two  dimension  and  for  dimension  higher 

than  2,  where  r  is  a  point  in  the  domain  of  interest.  Third,  we  consider  both  the 
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absorption  and  the  reduced  scattering  coefficients  for  the  derivation  of  the  Frechet 
derivatives.  Fourth,  we  show  that  the  m-th  order  Frechet  derivatives  are  equal  to 
the  m-th  order  terms  in  the  Born  expansion  up  to  constant  multiples,  whereas  other 
approaches  [3,  7,  11,  12,  13,  16,  31],  do  not  provide  the  higher  order  derivatives.  Even 
though  the  approaches  in  [3,  7,  11,  12,  13,  16,  31]  can  be  used  to  derive  the  higher 
order  derivatives,  the  approach  presented  in  this  paper  has  several  advantages  over  the 
previous  approaches:  First,  we  compute  the  m-th  order  Frechet  derivatives,  which  can 
not  be  easily  derived  by  the  previous  approaches,  by  showing  that  they  are  equal  to  m! 
times  the  m-th  term  in  the  Born  expansion.  Although  Born  expansion  is  well-known  in 
quantum  and  acoustic  scattering  and  DOT  [3,  6,  17,  18,  23,  25,  29,  35,  37],  to  the  best  of 
our  knowledge,  there  has  not  been  a  study  to  relate  the  higher  order  Frechet  derivatives 
to  the  terms  in  Born  expansion.  Note  that  the  inclusion  of  the  higher  order  Frechet 
derivatives  improves  the  resolution  of  the  reconstructed  optical  coefficients  of  DOT  [25] 
and  the  bounds  of  the  higher  order  Frechet  derivatives  can  be  utilized  in  the  analysis  of 
the  numerical  DOT  reconstruction  algorithms  [21].  Second,  the  recursive  structure  of 
the  Born  expansion  makes  it  possible  to  bound  the  m-th  order  Frechet  derivative  in  a 
variety  of  normed  spaces  by  the  m-the  multiple  of  the  bound  of  the  hrst  order  Frechet 
derivative. 

The  rest  of  our  paper  is  organized  as  follows:  In  section  2,  we  provide  a 
mathematical  formulation  of  DOT.  The  dehnition,  existence,  and  singular  properties 
of  the  Robin  function  are  given  in  Section  3.  The  validity  of  the  Born  expansion  and 
the  error  analysis  due  to  the  m-th  order  Born  approximation  is  given  in  Section  4. 
In  Section  5,  we  show  that  the  Frechet  derivatives  of  coefficient-to-solution  operator 
are  given  by  the  terms  in  the  Born  expansion.  Section  6  summarizes  our  results  and 
conclusion.  The  paper  concludes  with  two  appendices  providing  proofs  for  Lemmas  3.3 
and  4.6. 

2.  Photon  Diffusion  Equation  in  Frequency  Domain 

We  model  the  propagation  of  light  in  bounded  turbid  media  by  the  photon  diffusion 
equation  with  Robin  boundary  condition.  In  the  frequency  domain,  the  photon  diffusion 
equation  is  given  by: 

—  V  ■  (kV<F) -I- (/ia  +  — )‘h  =  g  in  D  (2.1a) 

c 

<h -|- 2az/ ■  =  0  on  (9D  (2.1b) 

where  D  is  a  Lipschitz  domain  in  M”,  n  =  2,  •  •  • ,,  dVl  is  its  boundary,  c  is  the  speed  of 
light,  uj  is  the  angular  frequency  of  the  source  g,  v  is  the  unit  outward  normal  vector  on 
the  boundary,  $  is  the  photon  density,  and  Ua,  and  n  =  ^7 — xvr  are  the  absorption. 
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reduced  scattering,  and  diffusion  coefficients,  respectively,  and  the  constant  a  accounts 
for  the  the  refraction  index  mismatch  at  the  boundary,  where  we  assume  a  is  a  constant 
and  K,,  p,a,  fi's  are  scalar  functions  satisfying 

0  <  L  <  K,Ha,l^'s,Ci  <  U.  (2.2) 

for  some  positive  constants  L  and  U. 

Let  us  define  the  partial  differential  operators  M.  and  J\f  defined  on  the  function 


space  on  and  respectively,  as  follows: 

=  -V  ■(KV'^)  +  (iXa  +  —)'^  for e  (2.3a) 

c 

FfT  =  T  +  2au-  (kVT)  for  T  G  (2.3b) 

Then,  (2.1)  is  represented  by 

=  g  in  (2.4a) 

=  0  in  (9f2.  (2.4b) 


If  the  partial  differential  operator  is  with  respect  to  r,  we  will  use  the  notation 
along  with  M.  and  M . 

In  the  following,  we  show  the  existence  and  uniqueness  of  the  weak  solution  of  (2.1). 

Proposition  2.1  Let  q  G  Then  there  is  a  unique  weak  solution  <h  G  H^{VL) 

satisfying  (2.1). 


Proof  By  multiplying  both  sides  of  (2.1a)  with  the  complex  conjugate  u  of  u  G  P[^{Q), 
integrating  over  and  using  integration  by  parts  and  (2.1b),  we  get  the  following  weak 
formulation: 


6($,u)  =  l{v), 
b{u,v)  =  /  kVuVvP 

Jn 


iuj  _ 
— )uv  + 


(2.5) 


where  c  is  the  complex  conjugate  for  the  complex  number  c  G  C.  The  sesquilinear  form 


b  :  H^{n)  X  H^{n)  - 

C,  for  complex  number  space  C,  satisfies 

iR.{b{u,  u)) 

> 

(2.6a) 

\b{u,v)\ 

< 

(^  +  “)  l  ^  IK^I  lh^lLL2(ao) 

< 

II“IIhi(o)  II^II//i(o)  > 

(2.6b) 

m\ 

< 

1  5'ILh'-1(0)  I  '^ILh'I(O)  > 

(2.6c) 
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where  stands  for  the  real  part  of  complex  valued  function  The  last  inequality 
in  (2.6a)  comes  from  Poincare  lemma  and  the  last  inequality  in  (2.6b)  comes  from  the 
trace  formula.  Using  (2.6),  there  is  a  unique  solution  of  (2.5)  by  Lax-Milgram  lemma 

|10].  □ 

In  the  rest  of  this  paper,  we  will  consider  two  cases  i)  O'  G  and  ii)  q  is 

a  Dirac  delta  function,  i.e.  q  =  5{-  —  ro),r°  G  D.  Proposition  2.1  shows  that  when 
q  G  we  get  <h  G  The  solution  of  (2.1)  when  q  =  S{-  —  r°)  is  called  the 

Robin  function.  Dirac  Delta  function  is  not  contained  in  since  it  is  contained 

in  H^{Q)  if  and  only  if  s  <  —  |  by  [8].  Thus,  we  can  not  conclude  from  Proposition 
2.1  that  the  Robin  function  is  contained  in  Rigorous  dehnitions  of  the  Dirac 

Delta  function  and  the  Robin  function  require  the  use  of  distribution  theory  [14,  15]. 
To  avoid  technicalities  involved  in  the  distribution  theory,  we  shall  follow  the  concepts 
in  [28]  and  use  Levi  functions  to  develop  a  rigorous  dehnition  of  the  Robin  function. 


3.  The  Robin  function 


The  solution  of  (2.3a)  when  g(-)  =  S{-  —  r^)  is  called  the  Green,  Neumann,  or  Robin 
function  depending  on  whether  the  operator  J\f  is  or  (2.3b),  respectively. 

Sometimes  Green,  Neumann,  and  Robin  functions  are  simply  called  the  Green  function 
without  any  regard  to  the  boundary  conditions.  In  this  paper,  however,  we  will  use  the 
term  Robin  function. 

First,  we  introduce  the  following  function  H  which  is  associated  with  the  dehnition 
of  Levi  functions  and  the  Robin  function. 

1  I  /1 2 — n  -. r) 

— ^  \r  —  r  \  n  >  A 

(3.7) 


Ufr  r'\  —  }  {n-2)uJnK{r') 

n[r,r)-<^ 

uJ2Kir') 


|r  —  r 

2d 


n>3 
n  =  2, 


where  r,r'  G  M”,  Un  is  the  hypersurface  area  of  the  unit  sphere  in  M”,  and  d  = 
sup,,!  |r^  —  r^|.  The  function  H  satishes 

V,  ■  {K{r')VrH{r,  r'))  =  0  for  r  G  D  \  {r'}.  (3.8) 

We  will  discuss  other  relevant  properties  of  the  function  H  at  the  end  of  Section  3.1, 
after  introducing  some  function  spaces  containing  iL(r,  •)  and  H{-,r). 


3.1.  Funetion  spaees 

In  this  section,  we  will  introduce  Sobolev  spaces  and  weighted  Sobolev  spaces  and  show 
the  boundedness  of  the  function  if,  which  is  closely  related  to  the  boundedness  of  the 
Robin  function.  To  simplify  our  notation,  for  the  rest  of  this  paper,  we  will  drop  D  from 
the  notation  of  the  function  spaces.  For  example,  we  will  use  instead  of  L^(D)  for 
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integrable  functions  on  Let  us  define  multi  index  f3  =  (/di,  132,  -  "  ,  Pn)  for  nonnegative 

integers  pi,i  =  1,  ■  ■  ■  ,n,  such  that  \p\  =  + p2-\ - h Pn,  and  Dh(f)  =  aiSrl - 

In  the  following,  we  introduce  the  Sobolev  spaces  and  the  associated  norms  given 
by  [1]: 

W^’P 

\i=o\p\=i  / 

where  k  =  1,2,  ■  ■  ■  ,p  >  1  and 

=  {0  e  11011^^  =  sup  0  <  CX)}, 

II/Loo  =  {0  e  L“|  ||0||^fc,oo  =  max  max  ||L)^0||^^  <  oo}. 

L  —  LI  ^  ‘  ‘  ‘  ^  rC  I  fj  I  — (j 

and  W^’°^  are  Banach  spaces  and  C  c  C  for  p  >  g  >  1, 

since  hi  is  bounded.  In  particular,  is  a  Hilbert  space  and  we  denote  it  by  and 
W^’P  :=  LP  is  called  Lebesque  space. 

Similarly,  the  weighted  Sobolev  spaces  and  for  tq  G  H,  a  real  number 

a,  and  the  associated  norms  are  given  by: 


=  {01  Il0llw"’“  •=  \r-roP  <  oo} 


ro,' 

rk,oo 


1=0, ■■■,k  \p\=l 


^ro’,log  “  (01  Il0llw'‘’7 

’  ^  ro,log 


(90 

max 

\  2=1, •••,n 

dri 

T  T  7  /c  1  ,  OC 
Wri^,n’ 

0(-) 


log(2d/|  •  -rol) 


(3.9a) 


<  cx)}  (3.9b) 


Then,  the  following  holds  for  110,^'^,  and  the  Sobolev  spaces: 


yj/fc,oo  ^ 

^  a  a- k<0, 

Wk,oo  ^  y^k,^  ^  if  ^  >  0, 

Wk,oo  ^  ^rk,c^^  a  a- k>n. 


for  k  =  0,1,2,  ■■  ■. 

The  following  proposition  shows  that  and  are  Banach  spaces. 


Proposition  3.1  Wfff  and  Wppffg  for  k  =  0,1,2,  are  Banach  spaces  with  norms 
given  in  (3.9a)  and  (3.9b),  respectively. 
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Proof  We  can  verify  that  ||■||„,fe.tx>  is  a  norm  by  nsing  the  dehnition  of  the  norm 
follows:  Let  <p  G  then  (r  —  G  for  \(3\  <  k. 

Thus,  triangular  inequality  in  Wfff  comes  from  the  triangular  property  in  L°^ .  If 
II0II  ^^A;,oo  —  0^  then  (fir)  =  0  for  r  ^  Tq,  which  means  that  0  =  0  almost  everywhere  in 

n.  ||c0||  p;^^fc,00  -  M II0II  y^fc,00  can  be  derived  in  a  straightforward  manner.  Thus,  Wfff 

is  a  normed  space.  Assume  that  {0p}p=i,...  is  a  Cauchy  sequence  in  that  is, 

{(r  —  ro)"'““0p}p=i,...  G  Since  is  a  Banach  space,  there  is  a  convergent  sub¬ 
sequence  ...  and  ip  G  such  that  ||(r  —  ■ro)"'““0p;  —  'ip\\y^k,oo  0  as  I  ^  oo, 

i.e.  ||0p,  —  (r  —  ro)"“""0||^yfe,oo  — 0  as  /  — *>  0.  Since  (r  —  ro)"“""0  ^  we  have 

proved  that  is  complete,  and  thus  is  a  Banach  space.  Using  a  similar  argument, 

it  is  straightforward  to  prove  that  are  Banach  spaces  using  log(p3^)  >  log 2  for 

all  r,  ro  G  fl.  □ 


For  further  information  about  the  weighted  Sobolev  spaces,  see  [20].  Noting  that 
H{r',-)  has  singularities  only  at  r'  with  order  0{\  ■  — we  get  the  following 
properties  of  the  function  H{-,r')  : 


H{-,r')  G  C'" 

=  (M-\{r'}), 

(3.10a) 

77 

H(-,r')  G  if  and  only  if  1  <  p  <  - ; - , 

n  +  k  —  2 

(3.10b) 

H (;,■')  n  =  3,4,---, 

(3.10c) 

ir-y)  e  Hbfe,  n  =  2, 

(3.10d) 

*  =  i, 

(3.10e) 

oriOrj 

(3.10f) 

Following  (3.10b),  we  can  write 

Hf,F)  G  PP 

if  and  only  if  1  <  p  <  cx)  when  n  =  2 

(3.11a) 

G 

if  and  only  if  1  <  p  <  2  when  n  =  2 

(3.11b) 

Hf,r')  G  PP 

if  and  only  if  1  <  p  <  3  when  n  =  3 

(3.11c) 

H{-,r')  G  W^’P 

3 

if  and  only  if  1  <  p  <  -  when  n  =  3 

(3. lid) 

H{-,F)  i  W^’P 

for  n  =  2,  3,  •  •  ■ 

(3.11e) 

By  simple  computation,  we  can  obtain  the  following  bounds  for  H  instead  of  (3.10c)- 

(3.10f): 

■  ^(„_2KL  J"  =  3,4,... 

(3.12a) 
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r,log  T,log 


dHf,  r') 

dH{r]  •) 

dvi 

dr-i 

1 

<  - 

UJ2L 

< 

r'  ,1 


if  n 

1 

iOnL 


d^H{-,r') 

dr'idr-j 


r'  ,0 


d‘^H{rf  ■) 

dr'idr-j 


< 


r'  ,0 


n 

UJyiL 


2 


(3.12b) 

(3.12c) 

(3.12d) 


for  all  i,j  =  1,  •  •  •  ,n.  Note  that  if  k  is  sufficiently  smooth,  (3.10)  and  (3.12)  hold  for 
//(r,  ■)  as  well. 


3.2.  Levi  funetion  and  Robin  funetion 

In  this  section,  we  provide  precise  dehnitions  of  the  Robin  function  and  investigate  the 
properties  of  the  Robin  function.  To  do  that,  the  dehnitions  and  properties  of  the  Levi 
function  will  be  introduced  following  the  approaches  in  [28] . 

Definition  3.2.  [Levi  funetion]  A  function  L(r,  r'),r,  r'  G  is  called  a  Levi  function  if 
L(-,r')  e  C^(f2  \  {r'}),  and  L(-,r')  —  H{-,r')  G  some  constant  A  >  0,  where 

A  is  the  order  of  the  Levi  function. 


Note  that  H{r,  r')  is  a  Levi  function,  and  H{r',  r)  is  also  a  Levi  function  of  order  1  if 
K  G  .  Thus,  if  L(r,  r')  is  a  Levi  function  of  order  A,  then  L(r',  r)  a  Levi  function  of 
order  min(A,  1).  Before  introducing  the  properties  of  the  Levi  function  in  the  following 
lemma,  let  us  state  some  known  results. 

Let  A(r,  •)  G  a  >  0  and 

u{r)  =  /  K{r,r')(i){r')dr' . 

Jn 

Then  the  following  facts  are  known: 

•  When  K{r,  •)  G  a  >  0,  there  exists  a  constant  Ci  =  Ci{a,p,  q)  which  depends 

on  a,p  and  q  such  that  the  following  holds  [36]: 

\\u\\l,  <  C'i(a,p,g)sup||A:(r,-)IUy-  UWlp 

reo 

72  72 

for  0  <  a  <  —  <  a  H — .  (3.13) 

p  q 

If  TL  is  a  Levi  function,  then  we  can  take  a  <  2  for  n  >  3  and  a  <  2  for  n  =  2, 
and  if  iL  is  a  derivative  of  a  Levi  function,  we  can  take  a  <  1.  However,  if  iL  is  a 
second  derivative  of  a  Levi  function,  we  must  choose  a  <  0  and  hence  can  not  use 

(3.13). 
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•  When  the  kernel  is  a  Levi  function  and  p  <  the  maximum  value  of  q  in  (3.13) 
is  taken  as  follows  : 

Tl 

<  C'i(2,p)sup||iL(r,-)||^o.^-  UWlp  for  p  <  -  (3.14a) 

Tl 

IImII^?  <  C'i(e)d"sup||iL(r,  OlUo-  ||0||^j,  for  p  =  -  (3.14b) 

reo  ^ 

by  taking  a  =  2  for  p  <  f  and  a  =  2  — e  for  p  =  f  where  e  is  some  constant  between 
0  and  2,  C'i(a,p)  =  C,{a,p,  and  Ci(e)  =  C'i(a  -  e,  f ,  f ). 

•  When  iL  is  a  Levi  function  and  p  >  |,  we  have  the  following  inequality: 

Tl 

\\u\\^o  <C2{p)sup\\K{r,-)\\w\^\\<P\\LP  foi’P>  (3-15) 

reo  ^ 

where  C2(p)  is  a  constant  depending  on  p  [36]. 

•  When  K  =  ,  there  exists  a  constant  C3  [5]: 

\\u\\lv<^U\\lv  (3-16) 

Although  the  constants  in  this  paper  may  depend  on  n,  we  will  neglect  this  dependence 
on  n  unless  it  is  needed.  Using  (3.13)  and  (3.16),  we  obtain  the  following  lemma  about 
the  properties  of  the  Levi  function.  Note  that  these  are  also  the  properties  of  the  Robin 
function. 


Lemma  3.3  Let  Lf,r')  be  a  Levi  function  of  order  A  >  0  and  assume  k,  G  and 
dQ  G  Let  fj  G  L^,p  >  1  and  v  be  given  by 

v{r)  =  /  L{r,r')ijj{r')dr' .  (3.17) 

Jq 

Then,  ^,i  =  are  absolutely  continuous  on  one-dimensional  line  parallel  to 

Tj-axis,  G  LP,  i,j  =  1,  •  •  • ,  n,  and  the  following  bounds  hold: 


\LP 


\LP 


dv 


dri 

d^v 


dridrj 


<  C'i(min  r-,2yp,p)sup  ||L(r,-)||^^o.^  n  >  3,  (3.18a) 

\P  J  r-eo 

<  C'i(min(-,2)  -  e,p,p)  sup  ||L(r,  ll^ll 

\V  )  r-eO  "’2 


hp 


LP 


n 


n  =  2, 0  <  e  <  2, 
dL{r\ 


<  Ui(min  (  — ,  1  ,p,p)  sup 

\p  /  r'eo 


dri 


ll^ll 


\LP 


LP 


< 


Ci(A,p,p)  sup 


i  =  !,■■■  ,n, 
d^(L-ff)(r',-) 


dridrj 

hi  = 


C's  1  ■ 

+  —  H - 

L  nL 


m 


LP 


(3.18b) 


(3.18c) 


(3.18d) 
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where  Ci  and  C2  are  introduced  in  (3.13)  and  (3.15),  respectively.  Furthermore,  the 
following  equations  hold: 


or 

^(?^)  =  /  -^{r,r')f){r')dr' , 


dv 
dri 

d^v 

dridrj 


In  i 


ir  = 


nK[r] 


-'fir)  +  lim 


d^L 


0  Jn\B{^)  ^Fdr-j 
Mrv(r)  =  —ip(r)  +  /  MrL{r,r')il){r)dr. 

Jn 

If  n  e  then 

v{r)  =  [  {v{r')Mr'L{r,r')  —  Mv{r')L{r,r')dF 


(r,  r')ijj[r')dr  , 


idn 


{J\fv{r')L{r,  r')  —  v{r')J\frL{r,  r'))dS{r'). 


(3.19a) 

(3.19b) 

(3.19c) 


(3.20) 


Proof  :  See  Appendix  A.  □ 


Definition  3.4  [Robin  funetion]  A  Levi  function  R  of  order  A  >  0  which  is  a  solution 
of  the  equations 


MrR{r,  r')  =  0  for  r  G  n  \  {r'}  (3.21a) 

J\frR{r,  r')  =  0  for  r  e  (9r2  \  {r'}  (3.21b) 

is  called  a  Robin  function.  Note  that  M.r  and  Mr  are  the  complex  conjugate  operators 
for  Adj.  and  A/).,  respectively. 

A  few  existence  theorems  of  Robin  functions  can  be  found  in  Section  19  and  Section 
22  in  [28].  For  the  rest  of  the  paper,  we  assume  that  the  Robin  function  exists  for 
n,  rc,  and  /ia-  Note  that  if  the  Robin  function  exists,  it  is  unique.  A  Levi  function 
which  satishes  (3.21a)  but  not  necessarily  (3.21b),  is  called  the  fundamental  solution. 
If  rc  G  and  pia  £  C^’^,  there  exist  fundamental  solutions  for  AI  in  hi  by  Theorem 
19. VIII  and  Section  22  in  [28].  However,  even  though  fundamental  solutions  exist,  these 
solutions  are  not  unique.  For  example,  iL  is  a  fundamental  solution  for  (2.1)  when 
/ia  =  0,  a;  =  0,  and  rc  is  a  constant. 

In  the  following,  we  investigate  the  properties  of  the  Robin  function. 

Lemma  3.5  Let  Rf,r')  be  a  Robin  function  of  order  A  >  0,  rc  G  and  dQ  G 
Let  fj  G  LP,p  >  1,  and  v  be  given  by 
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Then  (3.18),  (3.19),  and  (3.20)  hold  replacing  the  Levi  function  L  with  the  Robin 
function  R.  Furthermore,  we  get  the  following  equations  for  v  with  the  Robin  function 
as  the  kernel; 


Mrv(r) 

v(r) 

v(r) 
R(r,  r) 


-f{r), 

—  f  R{r,r')Mv{r)dr'  +  f  R{r,r)J\fv{r')dS{r'), 
Jn  Jan 

if  n  e  W^’P, 

I  R{r,r')q{r')dr'  if  v  satishes  (2.4)  and  q  G 
Jn 

R{r',  r). 


(3.23a) 

(3.23b) 

(3.23c) 

(3.23d) 


Proof  (3.18),  (3.19),  and  (3.20)  hold  because  a  Robin  function  is  also  a  Levi  function. 
(3.23a)  is  derived  by  using  the  dehnition  of  the  Robin  function  and  (3.19c).  Using  (3.20), 
we  derive  (3.23b)  and  (3.23c).  (3.23d)  is  induced  from  the  fact  that  the  adjoint  operator 
of  Ai  is  the  complex  conjugate  of  At  and  Theorem  10.1.  in  [28].  □ 


4.  Born  expansion 

In  this  section,  we  first  define  the  Born  expansion  in  the  normed  spaces  introduced 
in  Section  3.1  and  discuss  the  validity  of  the  Born  expansion.  Next,  we  compute  the 
error  in  the  m-th  order  Born  approximation  using  the  inequalities  developed  in  Section 
3.2.  In  Section  4.1,  we  analyze  the  Born  expansion  when  both  the  absorption  and  the 
diffusion  coefficients  are  perturbed.  Similar  analysis  is  applied  to  the  Born  expansion 
when  both  the  absorption  and  reduced  scattering  coefficient  are  perturbed.  In  Section 
4.2.,  we  analyze  the  Born  expansion  when  the  diffusion  coefficient  is  fixed  and  only  the 
absorption  coefficient  is  perturbed. 

Assume  that  (k,  pa)  is  changed  into  (k,  fta),  and  let  6k,  =  k,  —  k,  6p,a  =  fa  —  h'ai  and 
=  0  in  some  neighborhood  of  dVL.  Let  the  solution  of  (2.1)  for  the  optical  coefficients 
(k,  fa)  be  <h  such  that 

-V  ■  (kV$)  +  {fa  +  — )$  =  q  in  fl, 

c 

<h  +  2aK^—  =  0  on  (912 

OK 

Inserting  H  =  k  +  5k  and  fa  =  lVa  +  dfia  into  (4.24),  we  get 

-V  ■  (kV$)  +  ina  +  — )$  =  g  +  V  ■  ((5kV$)  -  6iaa^  in  12,  (4.25a) 

c 

<9$ 

<F  +  2aK— —  = 

OK 


(4.24a) 

(4.24b) 


0 


on  (912.  (4.25b) 
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Using  (3.23c),  <l>  can  be  represented  by 

$(r)  —  $(r)  =  (7^<l>)(r),  (4.26) 

where 

(7^^)(r)=7^(<5«;,<5/iJ^(r)  =  (7^l^)(r)  +  (7^2^)(r), 

(7^l^)(r)  =  7^l((5/ra)^(r)  =  -  /'<5/i,(r')i?(r,r')^(r')dr', 

Jn 

(7^2^)(r)  =  7^2((5fi;)^(r)  =  -  [  5K{r')\/R{r,r')\/^ir')dr'. 

Jq 

Note  that  in  the  derivation  of  (4.26),  (3.23c)  was  used.  Thus,  (4.26)  holds  when  <h  G  L^. 
Since  C  and  the  Robin  function  is  contained  in  for  all  p  >  1,  (n  =  2)  and 
1  <  P  <  {n  >  3),  where  n  is  the  space  dimension,  (4.26)  holds  at  least  for  the 
Robin  function  and  functions.  Using  (4.26)  recursively,  we  obtain  the  formal  Born 
expansion: 

$  =  $(0)  +  $(1)  +  $(2)  _ ^  <|,M  5(m+i)^  (4.27) 

where 

$(0)  =  (|)^ 

$(b  =  7^$ 

$(2)  =  7^7^$ 

$(”^1  =  77"^$ 

The  formal  expansion  (4.27)  exists  if  and  only  if  77^<h, /c  =  is  dehned.  We 

provide  the  following  dehnition  related  with  the  expansion  (4.27). 

Definition  f.l[m-th  order  representation  and  infinite  order  representation  with  index  M] 
The  integral  operator  77  has  an  m-th  order  representation 

Bq  ^  Bi  ^  B2  ^  ^  Bjn-  (4.28) 

if  there  are  normed  spaces  Bk^k  =  such  that  lZ{Bk-i)  C  Bk  for  all 

k  =  1,  ■  ■  ■  ,m. 

We  call  that  the  operator  77  has  an  inhnite  order  representation  with  an  index  M 

Bq  ^  Bi  ^  B2  ^  B3  ^  ^  Bm  — ^  Bm  — >  ■  ■  ■  (4.29) 

if  there  are  M  +  1  normed  spaces  Bq,  Bi,  ■  ■  ■ ,  Bm  such  that  lZ{Bk-i)  C  Bk  for  all 
/c  =  1,  •  •  • ,  M. 
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If  IZ  has  m-th  order  representation,  we  define  and  as  follows: 

+  <|>(h  +  $(2)  _ ^  ^(m)  (4.30a) 

=  $(0)  +  $(1)  <|,(2)  _ ^  <|,M_  (4.30b) 

If  the  operator  TZ  has  m-th  order  representation,  then 

=  ■■■  =  =  ^.  (4.31) 

by  (4.27).  If  TZ  has  inhnite  order  representation  with  an  index  M,  then  (4.31)  holds  for 
all  m  >  1.  If  we  define  as 

E°°^  =  <|>(°)  H _ h  H - ,  (4.32) 

then  E^^  G  Bm  and  we  can  easily  show  that 

E^  =  E^  =  ■■■  =  E^  =  ■■■,E^^  =  ^  (4.33) 


Furthermore,  if  the  operator  TZ  has  an  inhnite  order  representation,  we  have  the  following 
proposition; 


Proposition  f.2  Assume  that  the  operator  TZ  has  an  inhnite  order  representation  with 
an  index  M.  If  {E"^^}m=M,M+i,---  converges,  the  limit  is  E°°^  G  Bm-  The  necessary 
and  sufficient  condition  for  F™*!*,  m  =  M,  M  -|-  1,  •  ■  ■  to  converge  to  is 


lim 

/c— >oo 


p^M+k^ 


Bm 


0. 


The  sufficient  condition  for  (4.34)  is 


(4.34) 


(4.35) 


Proof  Since  =  77"^+^$,  (4.34)  is  the  necessary  and 

sufficient  condition  for  to  converge  to  E^^.  If  <  1,  then 

ll^''  Wbm  -  ll  Wbm^BmW  Wbm 

s  II’^"'0||b„  -  0  as  i  ^  oo.  (4,36) 


□ 


Definition  f  .3 [m-th  order  Born  approximation  and  (m-th  order,  infinite  order)  Born 
expansion]  is  called  the  m-th  order  Born  approximation  and  F"^$,F'^$  are 

called  the  m-th  order  and  inhnite  order  Born  expansion,  respectively.  F™<l>,  F“$  is 
just  called  Born  expansion,  since  they  are  equal  by  (4.33). 

Using  Proposition  4.2,  we  investigate  the  following  questions  about  Born  expansion 
and  Born  approximation: 
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•  When  does  the  inhnite  order  Born  expansion  exist?  In  other  words,  is  there  an 
inhnite  order  representation  with  an  index  M  for  the  operator  TZ  such  that 

Bo  — ^  -Bi  — >  -02  — >  -83  — ^  ■  ■  ■  — ^  Bm  — ^  Bm  — >  ■  ■  ■  (4.37) 

as  in  (4.29). 

•  Assume  that  there  exists  an  inhnite  order  representation  with  an  index  M  (4.37) 
for  the  operator  77.  By  Proposition  4.2,  (4.34)  and  (4.35)  are  the  necessary 
and  sufficient  condition  and  the  sufficient  condition,  respectively,  for  the  Born 
approximations  to  converge  to  the  Born  expansion  E°°.  Then,  what  are  the 
conditions  on  Sfia  and/or  Sn  for  the  operator  IZ  to  satisfy  (4.35)? 

•  Assume  that  converges  to  B°°.  Then,  what  is  the  error  in  the  m-th  order 

Born  approximation  (m  =  1,  2,  •  •  •),  that  is,  what  is  the  norm  bound  for  = 

^00  _  pm^ 

Although  it  is  possible  to  compute  the  error  in  the  Born  approximation  when  B“  does 
not  exist  or  B”^  does  not  converge  to  B°°,  we  will  treat  the  case  when  E°^  exists  and 
B™  converges  to  B°°.  In  the  following  sections,  we  first  relate  77  to  the  inhnite  order 
representation,  then  we  argue  about  the  condition  on  the  optical  coefficients  for  the 
norm  of  77  to  be  less  than  1.  Finally,  we  compute  the  error  in  the  m-th  order  Born 
approximation  with  respect  to  the  Born  expansion. 

f.l.  The  Born  expansion  when  both  the  diffusion  and  absorption  eoeffieients  are 
perturbed 

In  this  section,  we  treat  the  Born  expansion  when  both  the  diffusion  and  absorption 
coefficients  are  perturbed.  By  Proposition  4.2,  we  need  to  dehne  the  operator  77 
recursively  to  dehne  the  Born  expansion,  which  requires  the  dehnition  for  V77.  The 
kernel  of  the  integral  operator  V77i  is  the  derivative  of  Robin  function  which  is  a 
weakly  singular  kernel  and  contained  in  However,  the  kernel  of  V772  is  the  second 

derivative  of  Robin  function  which  is  a  hyper  singular  kernel  and  is  not  integrable.  Note 
that  the  treatment  of  integral  operators  with  hyper  singular  kernels  is  more  difficult  as 
compared  to  the  treatment  of  integral  operators  with  weak  singular  kernels  [19]. 

To  do  a  quantitative  analysis,  we  hrst  dehne  the  following  bounds  for  the  Robin 
function 


S{n)  : 

=  sup  1  B(r, 

■)  II —  3, 

(4.38a) 

refi 

r,2 

S{n)  : 

=  sup  1  B(r, 

■)l  ^  ~ 

(4.38b) 

reel 

r,log 

T(n)  : 

=  sup  1  (B  - 

r,2 

(4.38c) 
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Lemma  4-4  and  77-2  are  bounded  with  respect  to  norm: 


ll^hall 

iik2||„,.  ,P^Wi,P  ^  C*5  ||(5k|| 


(4.39a) 

(4.39b) 


where 


C4  =  S'(n)  max  (min  2J  ,p,p),  Ci(min  ij  ,p,p)J  n>3,  (4.40a) 

C4  =  S'(n)  max  (2  —  e,p,p),  Cl  (min  ,p,p)^  n  =  2,  0  <  e  <  2(4.40b) 

C5  =  Cl  (min  ,P,p)*S'(n)  +  Cl  (A,  p,p)T(n)  +  (4.40c) 


Proof  Let  fj  G  then  by  (3. 18a), (3. 18b),  and  (3.18c), 


i?(r,  r')6fia{r')'f{F)dr' 


R{r,  r')Sp,a{r')f!{r)dr 


dR{r,  r') 
drj 


5pia{j')'f(j')dr 


<  Ci(min(n/p,2),p,p)  \\R{r,-)\\^p  W^LaWL--  Mlp 

LP 

n  >3,  (4.41a) 

<  Ci(2/p  -  e,p,p)  \\R{r,  ||^ha|lioo  M^p 

(4.41b) 

ll'^h'dll  L°o  llt/’llip 


LP 


<  Ci(min(n/p,  l),p, p) 


n  =  2,  0  <  e  <  2, 
dR{r,  ■ 


LP 


dr-i 

i  =  1,  •  •  • ,  n. 


LP 


(4.41c) 


(4.39a)  is  derived  from  (4.41)  by  dehning  C4  as  in  (4.40a)  and  (4.40b).  Using  (3.18c), 
(3.18d),  and  (3.23d),  we  get 


f  dR{r,F)  ,^dilj{r')  , 


LP 


f  dRir^r)  ,^di){r')  , 


LP 


<  Ci(min(n/p,  l),p,p) 


dR{r, 


dri 


pKlI^oc  ||'?/’||iyi.p  n  >  3,i  =  1,  •  •  •  ,n,  (4.42a) 


LP 


f  dR{r,r')  ,^<9^/>(r')^  , 


<  Ci(2/p  -  €,p,p)  ||i?(r,  OIIlp  II^^IIloo  ||^||ii/i. 


LP 


n  =  2,  0  <  e  <  2,  i  =  1,  2, 


(4.42b) 


d'^R{r,r')  ,  , 

7-  J 


LP 


f  d‘^R{r,r') 
Iq  dridrj 


6K{r')fj{r')df 


LP 


<  ll^^llioo  WHwlp  (^Ci(min  ,P,p)S{n)  +  Ci{X,p,p)T{n)  +  . 

(4.42c) 
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Using  (4.42)  and  defining  U5  as  in  (4.40c),  we  get  (4.39b).  □ 

Using  Lemma  4.4,  we  state  and  prove  the  following  results  about  Born  expansion 
and  Born  approximation  when  both  the  absorption  and  reduced  scattering  coefficients 
are  perturbed. 


Theorem  f.B  The  integral  operator  IZ  has  an  inhnite  order  representation  with  the 
index  M  =  1  as  follows: 


(4.43) 


If 


C,\\6^Va\L  +  C,\\64^<l 


(4.44) 


holds,  then  exists  for  the  representation  given  in  (4.43)  and  the  m-th  order  Born 

approximation  converges  to  for  $  G  W^’^.  Furthermore,  the  error  between 

$  =  E"^^  and  E^~^^  is  given  as  follows: 


$  _  ^"1$ 


wrp 


<{C,\\SfVa\\^  +  C,\\Sn\ 


$ 


VCCp 


(4.45) 


Proof  From  (4.39),  we  derive 

ll^ll  ||<^ha|lLoo  +  ^5  •  (4.46) 

Hence  (4.43)  holds  and  (4.44)  is  the  sufficient  condition  for  E°°  to  exist  by  Proposition 
4.2.  (4.45)  holds  using  (4.46)  and  the  following  inequality: 

$  _  =  $(™)  =  7^™$. 

□ 


4-2.  Born  expansion  when  only  the  absorption  eoeffieient  is  perturbed 

In  this  section,  we  will  study  the  Born  expansion  and  the  Born  approximation  when 
=  0  and  6fia  7^  0.  When  Sk  =  0,  the  analysis  given  in  Theorem  4.5  holds. 
Furthermore,  we  can  say  more  about  the  Born  expansion.  Since  TZ  =  TZi,  we  do  not 
need  to  treat  the  second  or  hrst  derivative  of  Robin  function  as  the  kernel  of  the  integral 
operator  IZ. 

Before  analyzing  the  Born  expansion  in  the  normed  spaces  in  >  1, 

IU°j'(0g(n  =  2),  and  >  3)  for  ro  G  Q,  we  first  state  some  inequalities: 
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)  /  \og{2d/\r  —  r  \)dr  <  (log(2)  +  l)c<;2d,  n  =  2, 


ri  — , 


(4.47a) 

(4.47b) 


where 


Hi)  /  \og{2d/\r^  —  r'\)  \og{2d/\r^  —  r'\)dr'  <  CQUJ2d^,  n  =  2,  (4.47c) 

Jn 


/  _  ^2|a2-n^^/ 

Jn 

<  C7(oi,  02)cnnk^  —  if  Q;i  +  0:2  <  n, 

<  C8(oi,  02)c<;„  \og{2d/\r^  -  r'^l)  if  oi  +  02  =  n, 

<  C7(oi,  02)cc;„(i“^^“^~”  if  Oi  +  02  >  n 

[  |r^  -r'|“i-’^log(2d/|r'  -r^Ddr'  <  C'9(oi)o;,,d"b 


Ce  <  -(6(log2)^  +  21og21og3  +  logs  -  log2  -  1)  <  1, 

"  on— max(Q:i ,0:2)  i  i  ' 

^7(01,02)  =  2"-“^-“^  - +  — +  —  , 

71  —  (y.\  —  0(2  04\  0(2 


(4.47d) 

(4.47e) 

(4.47f) 

(4.47g) 


^8(01,02)  =  + 


1  1 
oi  log  2  02  log2j  ’ 


c,(„.)  =  Mi  ^ 

^  ^  oi2“i  n2“i  oi 

Proof  See  Appendix  B.  □ 

To  do  a  quantitative  analysis,  let  us  dehne  the  following  bounds  for  the  Robin 
function 


U{n) 

:=  sup||R(r,  •)||^;(,o«>  ,n  >  3 

r&n  '■-2 

(4.48a) 

U{n) 

:=  sup||R(r,  Olln/o,^,  ,n  =  2 
ren  ’"■'“s 

(4.48b) 

U  (n,  e) 

:=  sup  1  i?(r,  •)|Wo,oo  ,n>3,0<e<2. 
ren  ’"-2-'= 

(4.48c) 

With  the  aid  of  Lemma  4.6,  we  are  able  to  state  and  prove  the  following  inequalities  for 
the  integral  operator  77i. 


Lemma  4-L  We  have  the  following  norm  bounds  for  the  integral  operator  77i; 

i)  <  Ci(min  ,p,p)7/(n)  ||(5/ia||Loo  n  >  3,  (4.49a) 
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i)  <  C'i(^  -  e,p,p)t/(n)  ||(5/ia||ioo  n  =  2,0  <  e  <  2,  (4.49b) 


Hi)  ||7^i||^,x>^ioo  <  CwUJnU{n)\\5iXa\\L^ ,  (4.49c) 

'i'^)  -  <^7(2,  2)a;nt/(n)  ||(5/ia||oo  ^  =  3,5,6,  ••• ,  (4.49d) 

^  C'8(2,2)o;4?7(4)  ||(5/ia||^  n  =  4,  (4.49e) 

^  Ce  log  2u;2t^(2)  II 5/ia II 00  ^  =  2,  (4.49f) 

r,log  T'jlog 

where  the  constant  Ciq  is  given  by 

Cio  =  (log2  +  l)(i,  n  =  2  (4.50a) 

d^ 

Cio  =  —,  n>3  (4.50b) 


Proof  (4.49a)  and  (4.49c)  result  from  (3.18a)  and  (3.18b),  respectively.  (4.49c)  is 
derived  from  (4.47a)  for  two-dimension  and  (4.47b)  for  n-dimension  [n  >  3)  with  ai  =  2. 
(4.49d)  is  obtained  by  (4.47d)  and  (4.47f)  and  using  |ri  —  r2|  <  d  for  all  ri,r2  G  fl. 
Similarly,  (4.49e)  and  (4.49d)  are  derived  from  (4.47e)  and  (4.47g),  respectively.  □. 

By  using  Lemma  4.7,  we  give  the  following  theorem  about  Born  expansion  and 
Born  approximation  : 

Theorem  4.8  The  integral  operator  77  =  77i  has  the  following  inhnite  order 
representation  with  an  index  M  =  1  such  that 

LP  ^  LP  ^  LP  ^  PP  ^  ,  p>l,  (4.51a) 

^  ^  ^  ^  ,  (4.51b) 

^  ^  ,  n  >  3,  (4.51c) 

<irg  -  -  ■  ■  ■  n  =  2.  (4.51d) 

Given  the  normed  space  B  =  LP{p  >  1),  or  dehne  a  constant 

Gii  =  Gii(i7)  depending  on  the  normed  space  B  as  follows: 

Gi(min  (^“>2^  ,p,p)U{n)  li  B  =  PP  and  n  >  3, 

C,{l-e,p,p)U{2)  ifB  =  PP,n  =  2, 

CiQ(jJnU{n)  if  i?  =  L“, 

C7{2,2)unU{n)  iiB  =  W^)f^  and  n  =  3,  5,  6,  •  •  •, 

^§(2,  2)o;4G(4)  if  5  =  1^5“  and  n  =  4, 

Ge  log  2uj2U{2)  \i  B  =  and  n  =  2, 

where  0  <  e  <  2  and  r  &  Q.  If  the  following  condition 

ll'Spolloo  < 


(4.52) 
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holds,  then  exists  for  each  representation  given  in  (4.51)  and  the  m-th  order  Born 
approximation  converges  to  for  $  G  5.  Furthermore,  the  error  between 

$  =  E'^^  and  is  given  by: 

1$  <{Cn{B)\\6fia\Lr  5  •  (4.53) 

B  B 

The  proof  of  Theorem  4.8  is  obtained  by  Proposition  4.2  and  Lemma  4.7,  which  is 
similar  to  the  proof  of  Theorem  4.5.  In  Theorem  4.5  and  Theorem  4.8,  the  same  normed 
space  is  used  for  the  inhnite  order  representations  with  an  index  M  =  1.  However,  the 
following  theorem  is  another  kind  of  inhnite  order  representation  for  the  operator  77 
with  an  index  M  ^  1. 


Theorem  4.9  The  operator  77  has  the  following  inhnite  order  representations  with  index 
M  >2  and  Bm  =  where  is  the  normed  space  of  continuous  functions  having  the 
norm  IHI^oo: 


^0  _ ^  ^0  _ ^  ^0  _ ^  ^0  _ ^  ^0 


for  n  =  2,  3, 4,  •  •  • ,  (4.54a) 


—i.  L2(k-1) 


if  1  <  p  =  —  and  k  is  an  positive  integer, 
2k 


(4.54b) 


np 

IP  ^ 


L^v 


Tl  —  2j9  71 

for  I  =  [ — - ]  >0ifl<p7^  —  and  k  is  an  positive  integer,  (4.54c) 

ZiJ)  ZiK 


<’g 


for  n  =  2, 4, 6, 


fF°’“  o 

*  *  ro,n— 2 


p^0,oo 
*  ro,7i— 1 


f'olog 


CO  (jo 


(4.54d) 

for  n  =  3,  5,  7,  •  ■  ■  (4.54e) 


Let  the  normed  space  B  =  L“,  LP{p  >  1),  >  3),  hF°j“g(n  =  2).  If  the  condition 


- YfEE 

CioUnU  (n) 

holds,  then  77°°$  exists  for  each  representation  given  in  (4.54)  and  E"^^  converges 
to  77°°$  for  $  G  77.  The  bounds  for  the  error  between  the  (m  —  l)-th  order  Born 
approximation  and  $  =  77°°$  are  given  by: 


o 


$  _ 


np  <  Ci2(m,n,p)  (7/(n)  ||(5/ia|lo 

^  n  — 2mp 


if  m  < 


n  —  2p 


np  ^  n 
n  —  2mp  ~  2 


$-F— 1$  „<  C'i2(m,n,p)C'i3(e,n)(17(n)||<5/ia||^^)”^  $ 

L  e 


(4.56a) 


(4.56b) 
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IV) 


VI) 


Vll) 


Vlll) 


IX) 


X) 


<l)  _ 


< 


CO 


<l)  _ 


< 


CO 


$  _ 


w 


0,oo 


rQ,2m+2 


<|)  _ 


<|)  _ 


<|)  _ 


w, 


’'0 


< 


CO 


< 


CO 


<|)  _ 


< 


CO 


if  m  —  1  =  — :: — —  is  an  integer, 


2p 


(4.56c) 


C 


12 


n  —  2p 


,p,njC'i3C'2(^-)(C'ioo;„)™-^ 


n 


n  —  2p 


-1 


■  {U{n)  \\Sp 


a\\L° 


$ 


LP 


n  —  2p  n  —  2p 

it  — z -  IS  an  integer  and  m  > 


C'l2([ 


2p 

n  —  2p  N  /  „  N 


2p  ’ 


(4.56d) 


2p 


,p,n)(Cioa;n)™  ‘ 


{U{n)  \\5p 


all  L° 


$ 


n-2p  .  n-2p 

it  — ; -  IS  not  an  integer  and  m  > 


2p 

<  UZ,C7{2,2z){u;nU{n)\\6fia 


$ 


w 


2p 

0,oo  ’ 


if  2m  +  2  <  n, 

<  n-7iCr(2,2^)Q(2,n-2)(o;.t/(n)||<5/i,|L)' 


(4.56e) 


(4.56f) 


$ 


w 


0,oo  ’ 


if  2m  +  2  =  n, 

Ced 


((log2  +  l)duj2U{n)  \\6fj,a 


$ 


log  2  +  1 
if  n  =  2, 

n  —  2 

^7(2,  2*)C'8(2,  n  -  2)C,{2)d\Ci00Onr-^ 


w 


0,oo  ’ 


^0-2 

(4.56g) 


(4.56h) 


(unF(n)  \\5p,a 


$ 


w 


0,oo 


^’0’ 


if  n  is  even  and  2m  +  2  >  n  >  4, 

n—1 

C7(2,2z)d(Cioco^r-^  (c0nl/(n) 

if  n  is  odd  and  2m  +  2  >  n, 


(4.56i) 


$ 


w'J 


(4.56j) 


where  is  defined  in  (3.15);  Cq,  C7,  Cg,  and  Cg  are  defined  in  Lemma  4.6;  Cio  is  defined 
in  Lemma  4.7;  and  the  constants  C12  and  Ci^  are  given  by 

Ca(m,p,n)  =  n”iC,(2,^^) 

n  —  2tp 

:U{n,  e) 


Ci3(e,n)  =  Ci(e)d" 


U{n)  ■ 


Proof  From  (4.47a),  (4.47b),  (3.15),  and  using  L°°  C  for  all  p  >  1,  we  get  (4.56a) 
through  the  sequence  of  image  function  spaces  (4.54a).  From  (3.14),  (3.15),  and  (4.56b), 
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we  get  (4.56b)-(4.56e)  from  (4.54b)  and  (4.54c).  From  (4.47c)-(4.47g)  and  (4.56b),  we 
get  (4.56f)-(4.56j)  by  (4.54d)  and  (4.54e).  The  sufficient  condition  (4.55)  results  from 
(4.49c).  □ 


Thus,  by  Theorem  4.8  and  Theorem  4.9,  we  have  investigated  the  three  questions 
related  to  the  Born  expansion  and  the  Born  approximation  when  only  the  absorption 
coefficient  is  perturbed. 

We  now  focus  on  the  conditions  (4.52)  and  (4.55).  CQ,C^{2,2){n  = 

3,  5, 6,  •  •  •),  C8(2,  2)(n  =  4),  and  Cio  can  be  estimated  by 


C&  <  1, 

C^{2,2)  <  2”-^ 


-  3^-2 

n  —  4 


C'lo  <  1.7d,  n  =  2, 
d^ 

Cio  <  y ,  n  >  3. 


n  =  3,  5, 6,  7,  •  •  • , 
n  =  4, 


(4.57a) 

(4.57b) 

(4.57c) 

(4.57d) 

(4.57e) 


If  we  neglect  the  lower  order  term  R  —  H,  then  the  approximation  of  U (n)  is  as  follows: 

U{n)  ^  sup  |r  —  ro|"'“^|i7(r, ro)|  <  - -,n  >  3  (4.58) 

7/(n)  sup  |i7(r,  ro)|/log(|r  —  ro|/2(i)  <  ,n  =  2  (4.59) 

Using  (4.57)  and  (4.58),  the  conditions  (4.52)  and  (4.55)  for  the  m-th  order  Born 
approximation  to  converge  to  the  Born  expansion  can  be  changed  as  follows  : 


f^hallioo  <  YYd’’  n  =  2  ioi  (4.51b)  and  (4.54)  ,  (4.60a) 

(5/ia||^oo  <  — ^ ^  n  >  3  for  (4.51b)  and  (4.54)  ,  (4.60b) 

n  =  3,5,6,---,  for  (4.51c)  ,  (4.60c) 

n  =  4,  for  (4.51c)  ,  (4.60d) 

n  =  2,  for  (4.51d)  .  (4.60e) 

Note  that  all  conditions  in  (4.60)  depend  on  L,  which  is  the  lower  bound  of  k.  Given 
the  bound  of  Gi,  a  similar  analysis  can  be  obtained  for  the  representation  (4.51a)  of  the 
Born  expansion. 


d  fVd  1 1  j^oc  ^ 


log  2 
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In  this  section,  we  derive  the  Frechet  derivatives  of  the  coefficient-to-solution  and  the 
coefficient-to-measurement  operators.  We  use  the  relationship  between  the  Frechet 
derivatives  and  the  Born  expansion.  We  consider  the  cases  where  the  Born  expansion 
has  the  inhnite  order  representation  with  index  M  =  1  such  that 

B  ^  B  ^  B  ^  (5.61) 

where  B  =  when  both  the  diffusion  and  absorption  coefficients  are  perturbed 

(Theorem  4.5)  and  B  =  Lp,  or  IF°j“  when  only  the  absorption  coefficient  is 

perturbed  (Theorem  4.8). 

We  first  state  the  definition  of  the  Frechet  derivative  for  operators  defined  on  Banach 
spaces. 

Let  Bi  and  B2  be  Banach  spaces  and  BL{Bi,  B2)  be  the  Banach  space  of  bounded 
linear  operators  from  Bi  to  B2  with  the  norm 

1 1  (Dx  1 1 

||Q||=  sup  '^\QeBL{BuB2).  (5.62) 


Definition  5.1  [The  (first  order)  Freehet  derivative]  Let  P  :  S'  C  i?i  — i?2  be  an 
operator  from  a  open  set  S  of  Banach  space  Pi  into  a  Banach  space  P2.  Then,  P  is  called 
Frechet  differentiable  for  a:  G  S',  if  there  is  a  continuous  linear  operator  Q  :  Pi  — ^  P2 
such  that 


\\P{x  +  5x)-[P{x)  +  Q{5x)]\\^ 
hm  - — — - ^  =  0. 


||<5x|| 


||(5a;| 


Bi 


(5.63) 


The  linear  operator  Q  is  called  the  first  order  Frechet  derivative,  or  simply  the  Freechet 
derivative  of  P  and  denoted  by  P\x). 


Definition  5.2  [The  seeond  order  Freehet  derivative]  Let  P'{x)  be  the  Frechet  derivative 
of  P  :  P  C  Pi  ^  P2  for  each  xeS.  Then  P'  :  P  C  Pi  PL(Pi,  P2).  If  P'  is  Frechet 
differentiable  at  x,  we  denote  it  by 

P"(a;)  :  Pi  ^PL(Pi,P2),  (5.64) 

and  call  it  the  second  order  Frechet  derivative  or  Hessian  of  P  at  x. 


Definition  5.3  [The  m-th  order  Freehet  derivative]  Higher  order  Frechet  derivatives  are 
defined  recursively,  for  m  =  3,  •  ■  ■  by  P^'^\x)  :  Pi  — PL(P™“^,  P2)  such  that 

p(^_i)(^  +  5x)  -  [P^^-^fix)  +  P(™)(a;)(5a:]|| 
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We  can  also  view  as  a  mapping  from  Bf^,  the  Cartesian  product 

of  Bi  with  itself  m  times,  to  B2.  Let  us  denote  the  image  of  P^'^\x)  at  m- 
tuples  {5xi,5x2.,- ■  ■  ^dxm)  ^  -B™  by  P‘^'^\x)[5xi^5x2-,- "  and  P^'^\x;5x)  := 

p{m)  Sx,  -  ■  ■  ,  5x\ . 

5.1.  The  Freehet  derivatives  of  the  eoeffieient-to- solution  operator 

Let  us  dehne  the  coefhcient-to-solution  operator  ^  ■.  G  ^  B  such  that  =  <h,  where 
<h  G  -B  is  the  solution  of  (2.1)  and  x,  G,  B  are  given  by: 

G  =  L°°  xL°°,  B  =  when  a;  =  (5.66a) 

G  =  L°°,  B  =  when  x  =  k  and  p,a  is  hxed,  (5.66b) 

G  =  L“,  B  =  L”,  L“,  ,  or  , 

when  X  =  Ha  and  k,  is  hxed.  (5.66c) 

Let  X  be  given.  In  Section  4,  7^™  =  P"^{6x)  is  treated  as  a  bounded  linear  operator 
in  BL{B,  B)  mapping  $  G  i?  to  7^™((5a;)<h  G  B  with  a  given  5x.  However,  in  this  section, 
we  will  interpret  7^™  =  7?.™'(-)<h  as  an  operator  in  BL{G'^,  B)  from  ((5a;i,  •  •  • ,  5xm)  G 
with  a  given  $  such  that 

n^{5xi,  6x2,  •  •  • ,  6xm)^  =  n{6xi)n{6x2)  ■  ■  ■  n{6xm)^.  (5.67) 

Then,  by  a  similar  analysis  as  in  Theorem  4.5  and  Theorem  4.8,  we  can  show  that  the 
operator  77™'<h  is  bounded  for  the  operator  norm  from  G™  into  B  such  that 

\\n"^[Sxi,  ■  ■  ■  ,dXm)^\G^^B  ^  *^14  WHe-  (5-68) 

where  Gu  =  iaiax(G4,  G5),  G5,  and  Gii(i7)  depending  on  the  case  (5.66a),  (5.66b),  and 
(5.66c),  respectively,  by  Theorem  4.5  and  Theorem  4.8.  Note  that  in  the  Born  expansion, 
the  m-th  order  term  is  given  by  77™((5a;)<h. 

Theorem  S.f.  The  m-th  order  Frechet  derivatives  of  the  coefhcient-to-solution  operator 
4/  are  contained  in  BL{G'^,  B)  and  are  given  by 

m!77”^<h  when  x  =  {n,  Ha) 


Qm^, 

dxm 

Qm^, 


dHaF 


m—k 


(5.69a) 

(5.69b) 
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Proof  We  will  prove  (5.69)  first.  Assume  x  =  Inserting  m  =  2  in  (5.68),  we 

get 

||^(a;  +  6x)  -  (^(x)  +  n{6x)^)\\s  =  $  -  ($  +  n{6x)^) 

B 

=  7^2((5a;)$)  <  $  Cl^\\5x\\l ,  (5.70) 

B  B 

for  all  6x  &  G  and  $  G  5.  By  the  definition  of  the  Frechet  derivative  in  (5.63),  we  get 
H  =  77$.  In  this  proof,  we  use  Tl[x]  instead  of  77,  when  it  is  needed  to  stress  that  the 
Robin  function,  which  is  the  kernel  of  the  integral  operator  77,  depends  on  x. 

Next,  we  will  prove  that  =  m!77™$,m  >  2,  by  induction.  Let  us  assume 
for  an  integer  k  <m.  Then  by  the  dehnition  given  in  (5.65) 

+  Ok.i{\\Sx\y  (5.71) 

or  equivalently, 

77(^-^)[a;  +  (5a;]$  =  77(^-^)[a;]$  +  kn^'^\x]Sx<l>  +  Ofc_i(||(5a;||G),  (5.72) 

for  all  k  <  m.  Here  if  =  Ofc-idl^xHg')  means  that  hm||5a;||^^o  =  0.  Then 

using  (5.68)  and  (5.72) 

g  m-i  =  (^  ~  1)!77”^  ^[x  +  (5a;]$ 

=  {m  —  l)!(77[a;]  +  7l‘^[x]6x  +  0i{\\5x\\q))7V^~‘^[x  +  (5a;]$ 

=  {m  —  l)!(77[a;]  +  lZ^[x\5x  +  oi(||^a;||g)) 

•  (77”'“^[a;]$  +  (m  -  l)7V^~^[x\5x^  +  Om-2(||5a;||(^)) 

=  (m  —  l)!77"^“^[a;]$  +  m\7V^[x\5x^  +  Om(||5a;||(^) 

=  +  m\TV^[x\5x^  +  Orn(||(5a;||c).  (5.73) 

Using  the  dehnition  of  the  higher  order  derivatives  (5.65),  we  get  |^(a^)  =  m!77™'[a;]$. 
Hence,  by  induction  argument,  we  have  proved  (5.69a).  With  a  similar  argument  and 
noting  that  77i  is  independent  of  5k  and  that  772  is  independent  of  (5/ia,  we  prove  (5.69b). 
□ 

If  P  is  m-times  continuously  differentiable  on  S,  and  (x)  is  integrable  between 
any  two  points  in  S,  then  the  Taylor’s  theorem  holds:  For  any  x,x  +  5x  E  S,  we  have 

pii)(x) 

P(x  +  5x)  =  P(x) +  'S^ - 1 — -Sx"^  +  Em(x  +  5x,x]  P),  (5.74) 

i\ 

i=l 

where 

\M\r  u  f  ^ 

\\Emix  +  6x,x;P)\\^  <  - sup  \\P^"^\x  +  96x)\\  ,  (m)  .  .  (5.75) 

ml  06[O,1]  ,B2) 
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Although  the  statement  and  the  proof  are  similar  to  the  Taylor’s  theorem  in  Euclidean 
space,  we  must  consider  each  term  with  respect  to  the  operators  between  Banach 
spaces.  For  the  proof  of  (5.74),  see  [33].  If  another  operator  Q  is  m-times  differentiable, 
Em{.x  +  Sx,x;Q)  <  C  ||5a;||^^,  and  P{x)  =  Q{x),  then  we  can  show  that 

P^^{x)  =  Q^\x),i  =  1.  (5.76) 

Let  T”*  :=  P{x)  +  'YlT=i  be  the  m-th  order  Taylor  expansion.  Then  from 

Lemma  4.3  and  (5.76),  we  conclude  that  the  m-th  order  Born  approximation  is  the  same 
as  the  m-th  order  Taylor  expansion,  i.e.  T™  =  F™.  This  fact  can  be  used  as  another 
proof  of  Theorem  5.4. 


5.2.  The  Freehet  derivatives  of  eoeffieient-to-measurement  operator 


In  this  section,  we  compute  the  Frechet  derivatives  of  the  coefficient-to-measurement 
operator  T. 

Given  the  photon  density  function  <h,  which  is  the  solution  of  (2.1),  different  type 
of  boundary  data  can  be  measured.  Let  /  be  any  function  from  complex  space  C  to  C 
and  let  T  =  /(4/).  The  Frechet  derivatives  of  the  coefficient-to-measurement  operator 
T  can  be  computed  using  the  Frechet  derivatives  of  the  coefficient-to-solution  operator 
4/  by  using  change  of  variables  as  follows: 

r'  =  /'(T)T'  =  /'(T)77$ 

r"  =  /"(T)(T')^  +  /'(^)^"  =  /"(T)(77$)2  +  2/'(T)772$ 

and 

m 

j(^)M  ^  ^  ^  77(-)$),  m  >  3, 

i=l 

where  Am,i  is  a  polynomial  of  degree  m  and  Am,i{xi,  ■  ■  ■  ,Xm)  is  a  linear  combination  of 
monomials  with  =  "nr,  if  /  is  m  times  differentiable. 

Most  widely  used  functions  for  /  are 


f{x)  =  3^^(a;)  (5.77a) 

f{x)  =  3?(loga;)  (5.77b) 

for  some  complex  number  x.  (5.77a)  is  called  the  Born  measurement  and  (5.77b)  is 
called  Rytov  measurement. 

In  the  case  of  the  Born  measurements,  =  m!3?(77”^<h),  and  in  the  case  of  Rytov 
measurement,  the  first  and  second  order  Frechet  derivatives  are  given  by 


T' 


T" 


3? 


77$ 


-(77$)^ 


+ 


2772$  \ 


(5.78a) 
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$2 


(5.78b) 
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In  [21],  we  have  proved  the  local  convergence  of  a  method  which  we  call  Two-level 
Mnltiplicative  Space  Decomposition  Method  for  DOT  image  reconstruction.  In  the 
proof  of  the  convergence,  we  have  assumed  that  the  second  order  Frechet  derivative  of 
coefficient-to-measurement  operator  is  bounded,  when  Rytov  measurements  are  used. 
By  using  (5.70)  and  (5.78b),  the  second  order  Frechet  derivative  is  bounded  by 

||r"||G2^VKi-  <  (5.79) 


6.  Conclusion 

In  this  paper,  we  derived  the  Born  expansion  and  Frechet  derivatives  for  the  diffuse 
optical  tomography  for  arbitrary  domains  with  Robin  type  boundary  conditions.  To 
dehne  valid  Born  expansion,  we  introduced  sequences  of  appropriate  normed  spaces 
such  as  Lebesgue  spaces,  Sobolev  spaces,  and  weighted  Sobolev  spaces.  We  derived 
sufficient  conditions  on  the  perturbation  in  the  diffusion  and  absorption  coefficients  for 
the  convergence  of  the  Born  expansion  in  n  dimensions,  (n  >  2).  We  computed  bounds 
for  the  error  in  the  m-th  order  Born  approximation.  Next,  we  showed  that  the  m-th 
order  Frechet  derivatives  of  the  coefficient-to-solution  operator  is  equal  to  the  m\  times 
the  m-th  corresponding  term  in  the  Born  expansion. 

Although  we  only  consider  the  boundary  value  problem  (2.1)  with  Robin  boundary 
conditions,  the  analysis  introduced  in  this  paper  can  be  easily  extended  to  the  general 
second  order  elliptic  partial  differential  equations  with  other  boundary  conditions. 

Acknowledgments 

This  material  is  based  upon  work  supported  by  ONR-N00014-04-1-0694,  and  US  Army 
Medical  Research-W81XWII-04-l-0559.  Any  opinions,  hndings,  and  conclusions  or 
recommendations  expressed  in  this  material  are  those  of  the  authors  and  do  not 
necessarily  reflect  the  views  of  the  National  Science  Foundation.  Various  portions  of 
this  research  were  supported  by  the  Center  for  Subsurface  Sensing  and  Imaging  Systems, 
under  the  Engineering  Research  Centers  Program  of  the  National  Science  Foundation 
(Award  Number  EEC-9986821),  and  Rensselaer  Polytechnic  Institute. 

References 

[1]  R.  A.  Adams.  Sobolev  spaces.  Academic  Press  [A  subsidiary  of  Harcourt  Brace  Jovanovich, 

Publishers],  New  York-London,  1975.  Pure  and  Applied  Mathematics,  Vol.  65. 

[2]  S.  R.  Arridge.  Optical  tomography  in  medical  imaging.  Inverse  Problems,  15:R41-93,  1999. 

[3]  G.  Bao  and  P.  Li.  Inverse  medium  scattering  for  the  Helmholtz  equation  at  hxed  frequency. 

Inverse  Problems,  21(5):1621-1641,  2005. 

[4]  D.  R.  Bates.  Quantum  theory  I.  Elements.  Academic  Press,  1961. 


Born  expansion  and  Frechet  derivatives  in  diffuse  optieal  tomography 


29 


[5]  A.  P.  Calderon  and  A.  Zygmund.  On  the  existence  of  certain  singular  integrals.  Acta.  Math., 

88:85-139,  1952. 

[6]  B.  Chen  and  J.  J.  Stamnes.  Validity  of  diffraction  tomography  based  on  the  first  born  and  the 

first  rytov  approximations.  Applied  optics,  37:2996-3006,  1998. 

[7]  T.  Dierkes,  O.  Dorn,  F.  Natterer,  V.  Palamodov,  and  H.  Sielschott.  Frechet  derivatives  for  some 

bilinear  inverse  problems.  SIAM  journal  on  applied  mathematics,  62:2092-2113,  2002. 

[8]  G.  B.  Folland.  Introduction  to  partial  differential  equations.  Princeton  University  Press,  second 

edition,  1995. 

[9]  M.  Gell-Mann  and  M.  L.  Goldberger.  The  formal  theory  of  scattering.  Physical  Review,  91:398- 

408,  1953. 

[10]  D.  Gilbarg  and  N.  S.  Trudinger.  Elliptic  partial  differential  equations  of  second  order.  Springer- 

Verlag,  1983. 

[11]  F.  Hettlich.  Frechet  derivatives  in  inverse  obstacle  scattering.  Inverse  Problems,  ll(2):371-382, 

1995. 

[12]  F.  Hettlich  and  W.  Rundell.  The  determination  of  a  discontinuity  in  a  conductivity  from  a  single 

boundary  measurement.  Inverse  Problems,  14(l):67-82,  1998. 

[13]  B.  Hofmann.  A  denseness  theorem  with  an  application  to  a  two-dimensional  inverse  potential 

refraction  problem.  SIAM  J.  Math.  Anal,  30(4):896-911  (electronic),  1999. 

[14]  L.  Hormander.  The  analysis  of  linear  partial  differential  operators.  I.  Classics  in  Mathematics. 

Springer- Verlag,  Berlin,  2003.  Distribution  theory  and  Fourier  analysis.  Reprint  of  the  second 
(1990)  edition  [Springer,  Berlin;  MR1065993  (91m:35001a)]. 

[15]  L.  Hormander.  The  analysis  of  linear  partial  differential  operators.  II.  Classics  in  Mathematics. 

Springer- Verlag,  Berlin,  2005.  Differential  operators  with  constant  coefficients.  Reprint  of  the 
1983  original. 

[16]  A.  Kirsch.  The  domain  derivative  and  two  applications  in  inverse  scattering  theory.  Inverse 

Problems,  9(l):81-96,  1993. 

[17]  W.  Kohn.  The  validity  of  Born  expansions.  Physical  Review,  87:539-540,  1952. 

[18]  W.  Kohn.  On  the  convergence  of  Born  expansions.  Reviews  of  modern  physics,  26:292-310,  1954. 

[19]  Rainer  Kress.  On  the  numerical  solution  of  a  hypersingular  integral  equation  in  scattering  theory. 

J.  Comput.  Appl.  Math.,  61,  1995. 

[20]  A.  Kufner.  Weighted  Sobolev  Spaces.  Wiley,  1985.  Second  edition. 

[21]  K.  Kwon,  B.  Yazici,  and  M.  Guven.  Two-level  domain  decomposition  methods  for  diffuse  optical 

tomography,  submitted  to  Inverse  Problems. 

[22]  B.  A.  Lippmann  and  J.  Schwinger.  Variational  principles  for  scattering  processes  I.  Physical 

Review,  79:469-480,  1950. 

[23]  I.  Manning.  Error  and  convergence  bounds  for  the  Born  expansion.  Physical  Review,  139:B495- 

B500,  1965. 

[24]  V.  A.  Markel,  V.  Mital,  and  J.  C.  Schotland.  Inverse  problem  in  optical  diffusion  tomography  HI 

:  Inversion  formulas  and  singular  value  decomposition.  J.  Opt.  Soc.  Am.  A,  20:890-,  2003. 

[25]  V.  A.  Markel,  J.  A.  O’Sullivan,  and  J.  C.  Schotland.  Inverse  problem  in  optical  diffusion 

tomography.  IV.  nonlinear  inversion  formula.  J.  Opt.  Soc.  Am.  A,  20:903-912,  2003. 

[26]  V.  A.  Markel  and  J.  C.  Schotland.  Inverse  problem  in  optical  diffusion  tomography  I  :Fourier- 

Laplace  inversion  formulas.  J.  Opt.  Soc.  Am.  A,  18:1336-,  2001. 

[27]  V.  A.  Markel  and  J.  C.  Schotland.  Inverse  problem  in  optical  diffusion  tomography  H  :  Role  of 

boundary  conditions.  J.  Opt.  Soc.  Am.  A,  19:558-,  2002. 

[28]  C.  Miranda.  Partial  Differential  equations  of  elliptic  type.  Springer- Verlag,  1970. 

[29]  F.  Natterer.  An  error  bound  for  the  Born  approximation.  Inverse  Problems,  20(2):447-452,  2004. 

[30]  M.  S.  Pindzola  and  F.  J.  Robicheaux.  Time-dependent  born-series  calculations  of  three-body 


Born  expansion  and  Frechet  derivatives  in  diffuse  optieal  tomography 


30 


scattering  systems.  Physical  Review  A,  64. 

[31]  R.  Potthast.  Frechet  differentiability  of  boundary  integral  operators  in  inverse  acoustic  scattering. 

Inverse  Problems,  10(2):431-447,  1994. 

[32]  H.  Rabitz.  Born  series  for  potential  scattering.  Physical  Review  A,  5:620-627,  1972. 

[33]  L.  B.  Rail.  Computational  Solution  of  Nonlinear  Operator  Equations.  John  Wiley  &  Sons,  1969. 

[34]  T.  H.  Rihan.  Modified  distorted-wave  Born  series.  Physical  Review,  164:1247-1252,  1967. 

[35]  L.  I.  Schiff.  Quantum  mechanics.  McGraw-Hill,  1968. 

[36]  S.  L.  Sobolev.  Applications  of  functional  analysis  in  mathematical  physics.  Translated  from 

the  Russian  by  F.  E.  Browder.  Translations  of  Mathematical  Monographs,  Vol.  7.  American 
Mathematical  Society,  Providence,  R.I.,  1963. 

[37]  V.  I.  Tatarski.  Wave  propagation  in  a  turbulent  medium.  McGraw-Hill,  1961. 

[38]  J.  G.  Ye,  K.  J.  Webb,  R.  P.  Millane,  and  T.  J.  Downar.  Modified  distorted  born  iterative  method 

with  an  approximate  frechet  derivative  for  optical  diffusion  tomography.  J.  Opt.  Soc.  Am.  A, 
16:1814-1826,  1999. 


Born  expansion  and  Frechet  derivatives  in  diffuse  optieal  tomography 

Appendix  A.  Proof  of  Lemma  3.3 


31 


Using  C  all  0  <  e  <  a,  L(r,  ■)  G  lU°2-e  inserting  q  =  pto  (3.13),  we 

get  (3.18a)  with  Ui(niin  —  e,  2  —  ej  ,p,p).  Using  G  14^°,’“  and  inserting  q  =  p 

we  get  (3.19a)  and  (3.18c)  with  the  constant  Ui(niin  ij  ,p,p). 

We  start  to  prove  (3.19b)  and  (3.18d)  by  dehning 

dH{r,  r') 


Vi,&{r)  :  = 


Jn\B{r,5) 

Taking  the  derivative  of  Vi^s,  we  get 


drj 


dvj^s 

drj 


[r  = 


f  d‘^H{r,r') 
lQ\B{r,5)  dr-idrj 


-%lj{r')dr' . 


f!{r')dr'  — 


dH{r,  r') 


ldB{r,5) 


drj 


(A.l) 


fj{r')iyjdS{r'){A.2a) 


=  ipir) 


JdB{r,5) 

/  [ 

ldB{r,S)  L 


d‘^H{r,  r') 


ln\B{r,5)  dr-idr-j 


ijj[r)dr' 


dH{r,r')  ,  dH{r',r) 


drj 


drj 


z/|d^(r'), 


(A.2b) 

(A.2c) 


where  and  Uj  are  the  j-th  component  of  the  outer  normal  vector  with  respect  to  dfl 
and  dB{r,S),  respectively.  Let  us  assume  tf  G  C^'^.  Using  G  W^f 

the  integral  in  (A. 2c)  is  bounded  by 


0,oo 

1+A 


< 


>dB{r,5)  L 

i 

'dB{r,S) 

+ 


dH{r,F)  ,  dH{r',r) 

V-M 

d{H{r,  r')  +  H{r' ,  r)) 


v^.dS{r') 


drj 


f){r'] 


dS{r') 


f 

dH{r' ,  r) 

JdB(r,S) 

drj 

{r')  —  'f{r)  \  dS{r') 


5^ 

<  y^nll^llco.A 


\K\ 


CO.^ 


nL2 


nL 


(A.3) 


Thus,  by  (A.3),  the  integral  in  (A. 2c)  goes  to  zero  as  5  goes  to  zero  for  fj  G  And 
the  first  integral  in  (A. 2b)  is  Letting  5  go  to  zero,  we  get 


d^v 


drjdrj 


ir  = 


d\L-H){r,r'] 


dvj^s , 


drjdrj 


ip{r')dr  +  lim  (r) 


d\L-H){r,  r') 


drjdrj 


i/j{r')dr'  + 


(5^0  drj 

d^H{r,F)  fj{r 

-%p[r  )dr  — 


drjdrj 


nnir) 


(A.4) 


The  second  integral  of  the  righthand  side  of  (A.4)  is  bounded  in  the  sense  of  (3.16). 
Thus,  we  proved  (3.19b).  We  prove  (3.18d)  using  (3.13),  (3.16),  and  (A.4). 


d^v 


drjdrj 


<  Ci{\,p,p)  sup 

l^p  r'eo 


d\L-H){r',-) 


drjdrj 
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■  ^  II  V^llLP 


(A. 5a) 


Thus,  we  proved  (3.18d)  for  tjj  G  Extension  of  (3.18d)  when  f  E  can  be  found 
in  [5,  28], 

(3.19c)  follows  from  (3.19a)  and  (3.19b).  To  prove  (3.20),  using  Stokes  theorem  in 
n  \  B{r,  (5),  we  get 

[  [v{r')Mr'L{r,  r')  —  L{r,  r')Mr'v{r')]  dr' 


'n\B(r,S) 

=  [  [v{r')J\fr/L{r,r')  —  L{r,r')J\fr/v{r')]dS{r') 


'an 


'aB(r,5) 


n(r')«(r')^^^^^(r')  -  L{r,r')K{r')^{r') 


dS{r').  (A.6) 


Since  v  E  v  E  W'^  by  trace  formula,  each  term  of  the  second  integral 

of  the  righthand  side  of  (A.6)  has  the  following  asymptotic  behavior: 


'aB{r,5) 


dv 

L{r,r')K{r')^{r')dS{r') 


—  ^  I  ^1  (A. 7a) 


^  o  ^  -  H){r,r')  ,, 

v[r  )n[r  ) - - - (r  )d5{r  ^ 


'aB(r,S) 

<U\\v\ 


W^’P 


du 

d{L-H){r, 


dr'- 


w. 


0,oo 


(A.  7b) 


r,l  +  A 


fdB{r,5) 


I  /N  /  i.dH{r,r')  ,  , 

v[r  )K[r  ) - - - (r  )aS(r  ) 


'aB(r,l) 


\r'  —  r\ 


,  .  r  -  r  \  (  r'  -r 

V  0— - r  db 


|r'  —  r| 


(A.7c) 


(A. 7a)  and  (A. 7b)  go  to  0  as  5  goes  to  0  and  (A. 7c)  goes  to  v{r)  as  5  goes  to  0  by  the 
mean  value  theorem.  Thus  letting  <5  — 0  and  combining  (A.6)  and  (A. 7),  we  get  (3.20) 
interpreting  the  hrst  term  in  (A.6)  as  in  the  sense  of  (3.16).  □ 


Appendix  B.  Proof  of  Lemma  4.6 

(4.47a)  and  (4.47b)  are  obtained  easily  using  spherical  coordinates  with  respect  to  r^. 
Let  us  divide  hi  into  three  regions  depending  on  two  points  and  r^: 


hlj-i  =  {A  G  h2|  |r'  —  I  < 
Vtr'2  =  {r  E  r2|  |r'  —  r^|  < 
flc  =  {A  G  r2|  |r'  —  I  > 


I  ^  1  _  I 


I  ^  1  _ 


rjrt  J- 


-,  \r'  —  r^l  > 


I  1  _  I 
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\og{2d/\r^  —  r'\)  \og{2d/\r‘^  —  r'\)dr' 


\og{2d/\r^  —  r'\)  \og{2d/\r‘^  —  r'\)dr' 


+  /  log(2(i/|r  —  r  I)  log(2(i/|r  —  r  Ddr 
Jn  2 

+  [  log(2(i/|r^  —  r'l)  log(2(i/|r^  —  r'Ddr'. 


(B.l) 


Next,  consider  the  hrst  term  in  the  righthand  side  of  (B.l).  If  r'  G  then 

I  1  2 1 

f  ^  2^  .  Using  this  result  and  by  change  of  variables  with  respect  to  the 

spherical  coordinates  centered  at  r^,  we  get 


\og{2d/\r^  —  r'\)  \og{2d/\r'  —  r‘^\)dr' 


<  U2  log(4(i/|r^  —  r^l) 


p\og{2d/p)dp 


Likewise, 


<  log{4d/\r^  —  r^|)|r^  —  r^p[2  log{4d/\r^  —  r^|)  +  1]. 
16 


(B.2) 


\og{2d/\r^  —  r'\)  \og{2d/\r'  —  r‘^\)dr' 


<  log(4(i/|r^  —  r^|)|r^  —  r^|^[2  log(4(i/|r^  —  r^|)  +  1]. 

16 

I  /  2 1 

1  I  T I  T  T  oinrr  -hlnio  o  m VvTr  r\^  troT'io  m, 


(B.3) 


If  r'  G  flc,  then  |r^  —r'\>  h  Using  this  fact  and  by  change  of  variables  with  respect 
to  spherical  coordinates  centered  at  r^,  we  get 


\og{2d/\r^  —  r'\)  \og{2d/\r'  —  r^Ddr' 


<  /  log(6(i/|r' —  r"^!)  log(2(i/|r' —  r^Ddr' 

J  Vie 

j-d 

<U2  {\og3\og{2d/ p)  +  {\og{2d/p)f)pdp 

Jo 

<U2d^  ^(log2)2  +  ^(log3  -  l)(21og2  + 1)  . 

Inserting  (B.2),(B.3),(B.4)  into  (B.l),  we  get  (4.47c). 
Returning  to  (4.47b),(4.47d),  and  (4.47e),  we  get 


(B.4) 
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1^1  _  ^/|ai-n|^/  _  ^2\a2-n^^l 


+  /  |r^ 


(B.5) 


where  each  term  in  the  righthand  side  of  (B.5)  is  bounded  by 
[  \r^  —  r'f^~'^\r  —  r‘^\°'‘^~^dr 


^  _  ^‘2ia2-n 


|r^— "^dr 


r\F-F\/2 

<  2”-“2|ri  -  VWdp 

Jo 

_  1^1  _  .^2  |ai+02-r!. 


(B.6) 


Likewise 


1^1  _  ^l\ai-n\^/  _  ^2|02-n^^'  1^,1  _  ^2\ai+a2-n 


(B.7) 


Suppose  that  ai  +  a2  <  n,  then 


|r^  —  r^|“^  ^\r' —  r‘^f'^  '^dr 


^  gn-ai  /  1^/  _  ^2\ai+a2-2n^^/ 


„Qi+02-2n  n-1 


r|^l-r2|/2 


p”  Undp 


^  gn-cii 271-01-02 1.^1  _  ^2|ai+02-n  ^77 _ 

n  —  tti  -|-  C(2 

Inserting  (B.6),  (B.7),  (B.8)  into  (B.5)  and  considering  the  symmetry  of  and  for 
(B.8),  we  get  (4.47d).  (4.47e)  and  (4.47f)  is  derived  by  modifying  the  integral  area 
for  the  third  integral  in  (B.8)  into  Jj^i_^2|  ■  and  ■,  respectively.  Finally,  (4.47g)  is 
computed  in  a  similar  way  as  follows: 


log(2(i/|r' —  r^Ddr'  <  /  |r^— r'|“^  log(2(i/|r' —  r^DIdr' 


+  /  |r  —  r'|“^  ”1  log(2(i/|r' —  r^DIdr' +  /  |r^  — r'|"^  log(2(i/|r' —  r^DIdr' 

V  2  ^ 


<  Un  log(4(i/|r^  -  r^l)- 


-.1  _  ,y.2  \ai 


q;i2“i 


I  1  _  2  I  CKp 


log(4(i/|r^  —  r^l)  H — 


P«i  /  1  \ 

+ujn —  (  log(6)  H - I  <  iL>nd°‘^Cj{ai). 
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Abstract. 

In  Diffuse  Optical  Tomography  (DOT),  the  discretization  error  in  the  numerical 
solutions  of  the  forward  and  inverse  problems  results  in  error  in  the  reconstructed 
optical  images.  In  practice,  this  error  can  not  be  avoided,  but  can  be  reduced  by 
appropriate  discretization.  In  this  work,  we  analyze  the  error  in  the  reconstructed 
optical  absorption  images,  resulting  from  the  discretization  of  the  forward  and  inverse 
problems.  We  show  that  the  error  due  to  the  discretization  of  each  problem  is 
bounded  by  the  discretization  error  in  the  corresponding  problem  solution,  scaled 
spatially  by  the  solutions  of  both  problems.  Based  on  this  error  analysis,  we  develop 
adaptive  discretization  schemes  for  the  forward  and  inverse  problems  in  order  to  reduce 
the  error  in  the  reconstructed  images.  This  approach  leads  to  adaptively  refined 
composite  meshes  that  yield  the  desired  level  of  imaging  accuracy  while  minimizing 
the  computational  complexity. 


1.  Introduction 

Imaging  in  Diffuse  Optical  Tomography  (DOT)  is  comprised  of  two  interdependent 
stages  which  seek  solutions  to  the  forward  and  inverse  problems.  The  forward 
problem  is  associated  with  describing  the  NIR  light  propagation,  while  the  objective 
of  the  inverse  problem  is  to  estimate  the  unknown  optical  parameters  from  boundary 
measurements  [3]. 

There  are  a  variety  of  factors  that  affect  the  accuracy  of  the  DOT  imaging,  such 
as  model  mismatch  (due  to  light  propagation  model  and/or  linearization  of  the  inverse 
problem),  measurement  noise,  discretization,  numerical  algorithm  efficiency,  and  inverse 
problem  formulation.  In  this  work,  we  present  an  error  analysis  to  show  the  effect  of  the 
discretization  error  on  the  accuracy  of  the  reconstructed  optical  absorption  images.  Our 
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analysis  identifies  several  factors  which  inflnence  the  extent  to  which  the  discretization 
error  impacts  the  accuracy  of  the  reconstructed  images.  For  example;  the  mutual 
dependence  of  forward  and  inverse  problems,  the  number  of  sources  and  detectors 
and  their  conhguration,  and  the  formulation  of  the  inverse  problem.  In  particular, 
our  error  analysis  reveals  that  one  has  to  consider  the  inherent  dependence  of  the 
inverse  problem  solution  on  the  solution  of  the  forward  problem  in  order  to  develop 
an  effective  discretization  scheme  for  each  problem.  In  this  context,  based  on  the 
presented  error  analysis,  we  propose  adaptive  discretization  schemes  for  the  forward 
and  inverse  problems  which  ensure  stable  image  reconstructions  and  reduce  the  error  in 
the  reconstructed  optical  images  resulting  from  discretization. 

There  has  been  extensive  research  on  error  estimation  and  adaptive  mesh 
rehnement,  however  most  of  the  efforts  have  concentrated  on  the  estimation  of 
discretization  error  in  the  solutions  of  partial  differential  equations  (PDFs)  [6,  2, 
7,  8,  26,  28].  On  the  contrary,  relatively  little  has  been  published  in  the  area  of 
parameter  estimation  problems  governed  by  PDFs;  see  for  example  [24,  10,  23,  9]  for 
a  posteriori  error  estimates  and/or  adaptive  mesh  rehnement  strategies  in  parameter 
estimation  problems.  In  the  area  of  DOT,  Arridge  et  al  showed  that  approximation 
errors  resulting  from  the  discretization  of  the  forward  problem  can  lead  to  signihcant 
degradation  in  the  quality  of  the  reconstructed  images  [4] .  In  that  work,  the  error  in  the 
reconstructed  images  is  minimized  using  an  enhanced  imaging  model  that  treats  this 
additional  approximation  error  within  the  Bayesian  framework.  To  address  the  optical 
image  degradation  due  to  discretization,  several  investigators  have  reported  on  adaptive 
discretization  schemes  for  the  forward  and  inverse  problems,  yet  an  analysis  regarding 
the  error  in  the  reconstructed  optical  images,  resulting  from  discretization  has  not  been 
reported  so  far.  In  [12],  Fppstein  et  al  proposed  a  “data  driven  zonation”  scheme, 
which  can  be  viewed  as  an  adaptive  discretization  algorithm,  for  fluorescence  imaging. 
We  presented  a  region-of-interest  (ROI)  imaging  scheme  for  DOT  [15],  which  employed 
a  multi-level  algorithm  on  a  nonuniform  grid.  The  non-uniform  grid  is  designed  so  as 
to  provide  hner  spatial  resolution  for  the  ROI  which  corresponds  to  the  tumor  region 
as  indicated  by  a  priori  anatomical  image.  Torregrossa  et  al  proposed  an  a  priori  non- 
uniform  mesh  design  which  provides  high  resolution  at  the  heterogeneities  and  near 
boundary  regions  [27].  In  that  work,  the  mesh  rehnement  is  independent  of  the  source- 
detector  conhguration  and  the  location  of  the  heterogeneities.  Gu  et  al  proposed  a  dual 
mesh  strategy,  in  which,  a  relatively  hne  uniform  mesh  is  considered  for  the  forward 
problem  discretization  and  a  coarse  uniform  mesh  is  generated  for  the  inverse  problem 
discretization  [14].  In  the  same  study,  an  adaptive  rehnement  scheme  was  proposed  for 
the  inverse  problem  discretization,  but  no  adaptive  rehnement  was  considered  for  the 
solution  of  the  forward  problem.  Huang  and  Zhu  also  presented  a  dual-mesh  strategy 
which  makes  use  of  a  priori  ultrasound  information  [16].  In  that  work,  the  dual  mesh  is 
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a  coarse  mesh  for  the  background  tissue  and  a  relatively  hue  mesh  for  the  heterogeneity, 
similar  to  the  approach  in  [15] .  In  fluorescence  imaging,  Joshi  et  al  used  a  dual  adaptive 
mesh  strategy  to  discretize  the  inverse  problem  and  the  associated  coupled  diffusion 
equations,  where  the  rehnement  criterion  is  based  on  a  posteriori  discretization  error 
estimates  [19].  Note  that  in  all  these  studies  [12,  15,  27,  14,  16,  19],  the  mesh  rehnement 
criteria  considered  for  the  inverse  (forward)  problem  disregard  the  impact  of  the  solution 
of  the  forward  (inverse)  problem. 

In  this  work,  we  model  the  forward  problem  by  the  frequency-domain  diffusion 
equation.  As  for  the  inverse  problem,  we  focus  on  the  estimation  of  the  absorption 
coefficient.  We  consider  the  linear  integral  equation  resulting  from  the  iterative 
linearization  of  the  inverse  problem  based  on  Born  approximation  and  use  zeroth  order 
Tikhonov  regularization  to  address  the  ill-posedness  of  the  resulting  integral  equation. 
We  use  hnite  elements  with  piecewise  linear  Lagrange  basis  functions  to  discretize  the 
forward  and  inverse  problems  and  analyze  the  effect  of  the  discretization  of  each  problem 
on  the  reconstructed  optical  absorption  image.  Our  analysis  describes  the  dependence 
of  the  image  quality  on  the  optical  image  properties,  the  conhguration  of  the  source 
and  detectors,  and  the  regularization  parameter,  in  addition  to  the  discretization  error 
in  the  solution  of  each  problem.  In  this  respect,  we  separately  show  the  effect  of  the 
discretization  of  the  forward  and  inverse  problems.  Hence,  we  hrst  consider  the  impact 
of  the  inverse  problem  discretization  when  there  is  no  discretization  error  in  the  solution 
of  the  forward  problem,  and  provide  a  bound  for  the  resnlting  error  in  the  reconstructed 
optical  image.  Next,  we  analyze  the  effect  of  the  forward  problem  discretization  on 
the  accuracy  of  the  reconstructed  image  without  discretizing  the  inverse  problem,  and 
obtain  another  bonnd  for  the  resulting  error  in  the  reconstructed  optical  image.  We  see 
that  each  error  bound  comprises  the  discretization  error  in  the  corresponding  problem 
solution,  scaled  spatially  by  the  solntions  of  both  problems.  This  is  a  direct  conseqnence 
of  the  fact  that  the  inverse  problem  solution  depends  on  the  model  dehned  by  the 
forward  problem.  Finally,  based  on  the  two  error  bounds  provided  by  the  error  analysis, 
we  propose  a  practical  adaptive  discretization  scheme  for  the  forward  and  inverse 
problems,  respectively.  We  remark  that  the  mesh  rehnement  criterion  for  each  problem 
comprises  the  discretization  error  in  the  corresponding  problem  solution,  scaled  spatially 
by  the  solutions  of  both  problems.  Thus,  the  proposed  adaptive  mesh  generation 
algorithms  address  the  interdependence  between  the  forward  and  inverse  problems 
and  the  sonrce-detector  conhguration  as  well.  This  makes  the  proposed  adaptive 
discretization  algorithms  diherent  from  the  previous  approaches  [12,  15,  27,  14,  16,  19]. 
The  simulation  experiments  validate  the  implications  of  onr  error  analysis  and  show 
that  the  proposed  mesh  generation  algorithms  signihcantly  improve  the  accuracy  of  the 
reconstructed  optical  images  while  keeping  the  nnmber  of  nnknowns  in  the  forward  and 
inverse  problems  within  the  allowable  limits. 
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The  outline  of  this  paper  is  as  follows:  Section  2  dehnes  the  forward  and  inverse 
problems.  In  Section  3,  we  discuss  the  discretization  of  the  forward  and  inverse  problems. 
In  Section  4,  we  present  two  theorems  that  summarize  the  impact  of  discretization  on  the 
accuracy  of  the  reconstructed  optical  images.  In  Section  5,  we  introduce  the  adaptive 
mesh  generation  algorithms  for  the  solution  of  the  forward  and  inverse  problems.  In 
Section  6,  we  present  our  experimental  results,  which  is  followed  by  the  Conclusion 
section.  The  Appendix  includes  results  regarding  the  boundedness  and  compactness 
of  the  forward  operator  and  the  proof  for  the  convergence  of  the  inverse  problem 
discretization. 

2.  Forward  and  Inverse  Problems 

In  this  section,  we  describe  the  model  for  NIR  light  propagation  and  dehne  the  forward 
and  inverse  DOT  problems.  Table  1  provides  a  list  of  the  notation  used  throughout  the 
paper. 

2.1.  Forward  Problem 

We  use  the  following  boundary  value  problem  to  model  the  NIR  light  propagation  in  a 
bounded  domain  D  C 

-V  ■  D{x)Vgj  +  {p,a{x) +  =  Qj{x)  x  e  Q,  (2.1) 

So  ■ 

gj  +  2aD{x)-^  =  0  x  e  dQ,  (2.2) 

where  Qj{x)  is  the  point  source  located  at  xf  D{x)  is  the  diffusion  coefficient,  fj,a{x) 
is  the  absorption  coefficient,  u  is  the  modulation  frequency  of  the  source,  c  is  the  speed 
of  the  light  and  a  =  {1  +  R)/{1  —  R)  where  i?  is  a  parameter  governing  the  internal 
reflection  at  the  boundary  dQ.  Note  that  we  assume  the  diffusion  coefficient  is  isotropic. 
For  the  general  anisotropic  material,  see  [20]. 

The  adjoint  problem  associated  with  (2.1)-(2.2)  is  given  by  the  following  boundary 
value  problem  [3]: 

-V  ■  D{x)Vg:  +  ifiaix)  -  -)g*  =  0  G  D,  (2.3) 

c 

g*  +  2aD{x)^  =  Q*{x)  x  e  <9D,  (2.4) 

where  Q*(x)  is  the  adjoint  source  located  at  xf  In  this  work,  we  approximate  the 
point  source  Qj  in  (2.1)  and  the  adjoint  source  Q*  in  (2.4)  by  Gaussian  functions  with 
sufficiently  low  variance,  whose  centers  are  located  at  x{  and  x^^,  respectively. 
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Table  1.  Index  of  notation. 


Notation 

Explanation 

g{x,x{)  =  gj{x)  =  Qj 

Solution  of  the  diffusion  equation  for  the 
point  source  located  at  a;  = 

9*{x,x^fs  =  g*{x)  =g* 

Solution  of  the  adjoint  problem  for  the 
adjoint  source  located  at  a;  =  a?^ 

G^{x)  = 

Finite  element  approximation  of  gj{x) 

Gfix)  =  g; 

Finite  element  approximation  of  g*{x) 

ej{x)  =  ej 

The  discretization  error  in  the  hnite  element 
approximation  of  g{x,x{) 

=  e* 

The  discretization  error  in  the  hnite  element 
approximation  of  g*{x,xf) 

a{x)  =  a 

Small  perturbation  over  the  background  fj,a{x) 

=  T{fj)  =  Tij 

Differential  measurement  at  the  detector 

due  to  the  source 

Aa:  ^ 

The  integral  operator  mapping  a{x)  to  Tij 

A*  :  ^ 

The  adjoint  of  Aa 

H{xf  xf,  x) 

The  kernel  of  Aa 

H*{x;xi,xf} 

The  kernel  of  A* 

7 

A 

The  regularization  parameter 

a^{x)  = 

Solution  of  the  regularized  inverse  problem 

fi^) 

The  complex  conjugate  of  f{x) 

ll/(^)llo 

The  (or  H^)  norm  of  f{x)  over  the  domain  D 

ll/(^)llp 

The  norm  of  f{x)  over  the  domain  D 

ll/(^)l|oo 

The  norm  of  f{x)  over  the  domain  D 

\\f{x)\\Lp{n) 

The  norm  of  f{x)  over  the  domain  D 

\\fix)\\o,m 

The  (or  H^)  norm  of  f{x)  over  the 

hnite  element 

ll/(*)llp,m 

The  norm  of  f{x)  over  the  hnite  element  Qm 

\\f{x)\\oo,m 

The  norm  of  f{x)  over  the  hnite  element  Qm 

For  the  solution  of  the  forward  problem,  we  consider  the  variational  problem 
formulations  of  (2.1)-(2.2)  and  (2.3)-(2.4): 

[wfj-DVgj  +  fj{{fia  +  —)9j-Qj)+[  =  (2.5) 

Jn  c  Jqq  za 

where  fj  G  is  a  test  function.  Equivalently,  we  can  express  (2.5)  using  the  sesquilinear 
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form: 


where 


■=  i^9j  >= 


(2.6) 


dj)  :=  Vfj-  DVgj  +  (/ia  +  —)i^9j, 


i'fiQj)  ■  I 
Jn 

Similarly,  the  variational  problem  for  (2.3)-(2.4)  can  be  formulated  as: 


b*{ij,g:)  :=  A{ij,g:)+  <  fj,  ^g*  >=<  ^Q*  >,  (2.7) 

where  in  A{f!,g*),  u  is  replaced  by  —u. 

The  sesquilinear  forms  bifj^gf),  b*{f!,g*)  are  continuous  and  positive  dehnite  [17]. 
As  a  result,  the  variational  problems  (2.6)  and  (2.7)  have  unique  solutions,  which 
follows  from  the  Lax-Milgram  Lemma  [11].  Furthermore,  the  solutions  gj  and  g*  of 
the  variational  problems  (2.6)  and  (2.7)  belong  to  which  results  from  the 
boundedness  of  the  Gaussian  function  that  approximates  the  point  source  Qj  and  the 
adjoint  source  Q*  [17].  Assuming  D{x),  paix)  G  and  noting  that  Qj^Q*  G 

the  solutions  gj.,gl  satisfy  gj^g*  G  (Chapter  6.3,  Theorem  2  in  [13]).  This  last 

condition  implies  (Chapter  5.6,  Theorem  6  in  [13]) 


g„g:eC\n). 


(2.8) 


2.2.  Inverse  Problem 


In  this  work,  we  focus  on  the  estimation  of  the  absorption  coefficient;  therefore  we 
assume  D{x)  is  known.  To  address  the  nonlinear  nature  of  the  inverse  DOT  problem,  we 
consider  an  iterative  algorithm  based  on  repetitive  linearization  of  the  inverse  problem 
using  hrst  order  Born  approximation  [3].  As  a  result,  at  each  linearization  step,  the 
following  linear  integral  equation  relates  the  differential  optical  measurements  to  a  small 
perturbation  a{x)  on  the  absorption  coefficient  Pa{x): 


^{Xd:Xi)=  -  /  g*{x,xf)g{x,xi)a{x)dx 


=  /  H{xfxf,x)a{x)dx 
Jn 


(2.9) 


(2.10) 


where  x]i,  i  =  1, ...  ,Nd  and  xfj  =  1, ...  ,Ns  are  respectively  the  detector  and  source 
positions,  H{xi,xf,x)  =  —g*{x,xf)g{x,xi)  is  the  kernel  of  the  integral  operator 
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Aa  :  g{x,xi)  is  the  solution  of  (2.6),  g*{x,xf)  is  the  solution  of  (2.7), 

and  r(a;)j,  xi)  represents  the  differential  measurement  at  the  detector  due  to  the 
source.  Note  that  approximating  Q*  in  (2.4)  by  a  Gaussian  function  centered  at  x^ 
implies  that  T{xli,xi)  corresponds  to  the  scattered  optical  held  evaluated  at  x^j^,  after 
hltering  it  by  that  Gaussian  function.  Thus,  the  Gaussian  approximation  of  the  adjoint 
source  models  the  hnite  size  of  the  detectors.  Similarly,  approximating  Qj  in  (2.1)  by  a 
Gaussian  function  models  the  hnite  beam  of  the  source. 

Let  gj  and  g*  in  (2.9)  be  the  solutions  of  (2.6)  and  (2.7),  respectively.  Then, 
the  linear  operator  Aa  :  dehned  by  (2.9)  is  compact  and  bounded  by 

(see  Appendix  A  and  Appendix  B) 


<  A'dA^smax||5(*||omax||5(j||o.  (2.11) 

*  j 

For  the  given  solution  space  for  a,  the  compactness  of  the  linear  operator  Aa  implies 
the  ill-posedness  of  (2.9).  Hence,  we  regularize  (2.9)  with  a  zeroth  order  Tikhonov 
regularization.  This  yields  the  following  equation  which  dehnes  our  inverse  problem  at 
each  linearization  step: 


7  =  -4:^  =  (KAa  +  A/)a"  (2.12) 

=  (2.13) 

where  A  >  0  and  is  an  approximation  to  a.  In  this  representation.  A*  : 
is  the  adjoint  of  Aa  and  /  is  the  identity  operator. 

Let  A  =  Al^Aai  then  A  :  —a  is  dehned  as  follows: 


Na,Ns 


{Aa){x)=  H*{x-xfx\)  /  H{xfx\-x)a{x)dx 


=  /  K{x,x)a{x)dx, 

Jq 

where  k{x,  x)  stands  for  the  kernel  of  the  integral  operator  A  and  is  given  by 


(2.14) 


Nd,Ns 

k{x,x)=  Y  H*{x;xi,xf}H{xi,xf,x), 


(2,15) 


and  H*{x]xi,xf)  is  the  kernel  of  the  adjoint  operator  A*  given  by: 


Nd,N, 


{KP){x)  =  >  ,  H*{x-xfx\)(3{x\,xi)  = 


Nd,Ns 

E 


-g*{x,  x\)g{x,  xi)l3{x\,  x^),  (2.16) 


for  all 

Having  dehned  the  adjoint  integral  operator,  we  note  that  the  operator  A  = 
A*Aa  :  L°^  —A  is  compact  and  that  the  operator  K  :  —a  is  bounded  by 

IIA'II  <  ||Aa|p  +  A.  In  this  work,  we  assume  that  the  solution  G  also  satishes 
G  .  For  the  rest  of  the  paper,  we  will  denote  L°^  and  by  X  and  H,  respectively. 
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3.  Discretization  of  the  Inverse  and  Forward  Problems 


In  this  section,  we  outline  the  discretization  of  the  inverse  and  forward  problems. 

3.1.  Inverse  Problem  Discretization 

In  practice,  we  seek  a  hnite-dimensional  approximation  to  the  solution  of  the  inverse 
problem  (2.13)  at  each  linearization  step.  Therefore,  we  discretize  (2.13)  by  projecting 
it  onto  a  hnite  dimensional  subspace. 

Let  Xn  G  X  and  Yn  E  Y  denote  a  sequence  of  hnite-dimensional  subspaces  of 
dimension  n,  spanned  by  piecewise  linear  Lagrange  basis  functions  {Li, . . .  ,L„},  and 
{xp},  p  =  1, . . .  ,n,  he  the  set  of  collocation  points  on  fl.  Then,  the  projection  operator 
Pn  '.  Y  ^  Yn  associated  with  the  collocation  method  is  dehned  by 

n 

Pnf{x)  =  '^f{xp)Lp{x),  xeQ,  (3.17) 

p=i 

for  all  f{x)  G  Y. 

The  collocation  method  approximates  the  solution  of  (2.13)  by  an  element  G  X„ 
satisfying 

{Ka^){xi)  =-f{xi),  l  =  l,...,n.  (3.18) 

Using  the  piecewise  linear  Lagrange  basis  functions  Lp{x)  for  we  express  as 

n 

=  ^apLp{x).  (3.19) 

p=i 

Then,  projection  by  collocation  method  for  the  operator  K  yields 

n  « 

Xai  +  2^(^p  i^{^i,d!)Lp{x)dx  = 'j{xi),  l  =  l,...,n,  (3.20) 

p=i 

which  is  equivalent  to 

PnKa^  =  Pni.  (3.21) 

3.2.  Forward  Problem  Discretization 

In  this  section,  we  consider  the  hnite  element  discretization  of  (2.6)  and  (2.7),  and  use 
their  solutions  to  approximate  H^xfxh'jx)  and  H* [x]  x{,  x'^f) .  As  a  result,  we  obtain 
hnite  dimensional  approximations  to  K  and  7. 

Let  Lk{x)  be  the  piecewise  linear  Lagrange  basis  functions.  Replacing  and 
Qj  in  (2.6)  with  their  hnite-dimensional  counterparts  T  =  Ylk=iPkLk{x),  Gj  = 
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YlkU^kLk{xf  and  replacing  fj  and  g*  in  (2.7)  with  d'  =  G*  = 

'f2k=i'^kLk{x)  yields  the  matrix  equations: 

Sc,  =  q^.,  (3.22) 

S*d,  =  qf  (3.23) 

for  Cj  =  [ci,  C2,  •  •  • ,  CNj]'^  and  d*  =  [di,d2,  -  ■  ■ ,  Here  S  and  S*  are  the  hnite 

element  matrices  and  and  q*  are  the  load  vectors  resulting  from  the  hnite  element 
discretization  of  (2.6)  and  (2.7).  Note  that  for  each  source  (detector),  the  dimension  of 
the  hnite  element  solution  Gj  {G*)  can  be  diherent,  therefore  Nj  (iVj)  may  vary. 

The  boundedness  of  the  solutions  gj  and  g*  implies  that  the  discretization  error 
Cj  and  e*  in  Gj  and  G*  on  the  hnite  element  is  bounded.  A  bound  for  e,-  and  e* 
can  be  found  by  using  the  interpolation  error  estimates  (Theorem  4.4.4  in  [11]): 

||e,||o,m  <  G\\gj\\i^mhi„  (3.24) 

||e*||o,„^<C'||(7;||i,,„hi„,  (3.25) 

where  C*  is  a  positive  constant  and  {hff)  is  the  diameter  of  the  smallest  ball  containing 
the  naf^  hnite  element. 

3.3.  Discretization  of  the  Inverse  Problem  with  Operator  Approximations 

Substituting  the  hnite  element  approximations  Gj  and  G*  in  (2.14)  and  (2.16),  and 
using  the  resulting  hnite-dimensional  operator  approximations  in  (3.21),  we  obtain  the 
following  linear  system  in  terms  of  which  approximates  a^: 

Pnka^^  =  Pnf.  (3.26) 

In  (3.26),  the  operator  K  ■.  X  is  the  hnite  dimensional  approximation  of  K  in  (2.13) 
and  PnK  :  Y^.  Similarly, 

7  =  ^:r,  (3.27) 

where  A*  is  the  approximation  to  the  adjoint  operator  A*,  obtained  by  substituting  Gj 
and  G*  in  (2.16). 


4.  Discretization-based  Error  Analysis 

As  a  result  of  the  discretization  of  the  forward  and  inverse  problems,  the  reconstructed 
image  in  (3.26)  is  an  approximation  to  the  actual  image  a^.  Thus,  the  accuracy 
of  the  reconstructed  image  depends  on  the  error  incurred  by  the  discretization  of  the 
forward  and  inverse  problems. 

In  this  section,  we  analyze  the  ehect  of  the  discretization  of  the  forward  and  inverse 
problems  on  the  accuracy  of  DOT  imaging.  The  analysis  is  carried  out  based  on  the 
inverse  problem  at  each  linearization  dehned  by  (2.13)  and  the  associated  kernel  k{x,  x). 
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In  this  work,  we  follow  an  approach  which  allows  us  to  separately  analyze  the 
effect  of  the  discretization  of  each  problem  on  the  accuracy  of  the  reconstructed  optical 
image.  In  this  respect,  we  hrst  consider  the  impact  of  projection  (i.e.  inverse  problem 
discretization)  by  collocation  method  when  the  associated  kernel  k{x,x)  in  (2.13)  is 
exact.  Next,  we  explore  the  case  in  which  the  kernel  is  replaced  by  its  hnite-dimensional 
approximation  (i.e.  degenerate  kernel)  and  analyze  the  effect  of  the  forward  problem 
discretization  on  the  accuracy  of  the  reconstructed  image  without  projecting  (2.13). 

Our  analysis  reveals  that  even  the  kernel  is  exact,  the  accuracy  of  the  solution 
approximation  in  (3.21)  resulting  from  the  inverse  problem  discretization  depends 
on  the  kernel  k{x,  x)  of  the  integral  operator.  Likewise,  the  error  in  the  reconstructed 
optical  image  due  to  the  discretization  of  the  forward  problem  is  a  function  of  the 
inverse  problem  solution.  These  results  suggest  that  the  discretization  of  the  inverse 
and  forward  problems  can  not  be  considered  independent  of  each  other. 

f.l.  Case  1:  The  kernel  k,{x,x)  is  exact 

In  this  section,  we  show  the  effect  of  projection  on  the  optical  imaging  accuracy.  In  the 
analysis,  we  assume  that  the  kernel  k{x,  x)  is  exact.  We  hrst  prove  the  convergence  of 
the  projection  method  for  the  operator  K,  and  then  analyze  the  effect  of  projection  on 
the  imaging  accuracy. 

Clearly,  the  inverse  operator  K~^  :  Y  ^  X  exists  since  K  is  positive  dehnite  for 
A  >  0.  Furthermore,  by  the  compactness  of  A  and  Riesz  Theorem,  the  inverse  operator 
K~^  is  bounded  by 

\\K-^\\  <  (4.28) 

A 

Lemma:  Projection  by  collocation  method  for  the  operator  K  ■.  X  ^  Y  converges. 
Specihcally,  the  operators  PnK  :  W  are  invertible  and  {PnK)~^PnKa^  — >  a^, 

n  — >  cx).  Furthermore, 

\\{PnK)-^PnK\\x^y„  <  (4.29) 

for  some  Cm  >  0  independent  of  n. 

Proof.  See  Appendix  C.  □ 

Based  on  the  Lemma,  the  following  theorem  provides  an  upper  bound  for  ||q;^  — 
•anlUpo),  where  is  the  solution  of  (3.21): 

Theorem  1:  Let  {flm}  denote  a  set  of  hnite  elements  for  m  =  1,  •  •  •  ,Nx,  such  that 
Um^  ^rn  =  C,  and  hm  be  the  diameter  of  the  smallest  ball  that  contains  the 
element.  Then, 

—  'a^llLl(O)  <  C^/VQ\\I  —  TnWy^Xn 

m=l 
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^  Na  Na,Ns 

E  E  ll9.*»llo,».l|o^lli,™A».. 

A  _  .  . 

m=l 

where  C*  is  a  positive  constant,  is  the  volume  of  fl  and  Y  ^  is  a 
uniformly  bounded  operator  given  by  =  [/  + 

Proof. 

a^-a^  =  [I-{PnK)-^PnK]a^ 

=  [I-  {PnK)-^PnK]{a^  -  fj)  (4.30) 

since  [/—  {PnK)~^ PnK]f!  =  0,  where  fj  G  is  the  interpolant  of  a^.  Using  (C.3), 

[/  -  {PnK)-^PnK]  =/-[/+  jPnA]-^jP^K 


=  I-T^-K, 


(4.31) 


where  T„  =  [/  +  yPn^]  ^Pn  is  a  uniformly  bounded  operator  (see  Appendix  C). 
We  use  K  dehned  by  (2.13)  and  (4.31)  in  (4.30)  to  obtain 


a 


A  ^  A 


T 

-*■  tl. 


af,  =  {I-  Tn){a^  -fj)-  ^A(a"  -  fj). 


Then  we  use  the  dehnition  of  A  in  (4.32)  and  hnd 


=  {I  —  Tn){a'^  —  f))  —  ^  I  K{x.,x){a^  —  ip){x)dx. 


T 

-*■  n 


A 


(4.32) 


(4.33) 


This  leads  to 

,A  „  A 


+  ^||T„||y^x„||  f  n{x,x){a^ -f:){x)dx\\Li(n) 


<  VVn\\I-Tn 


Y^xJa 


+ -\\Tn\\Y^x„  dx  \K{x,x){a^  -  fj){x)\dx, 
"A  Jn  Jn 

The  second  term  in  (4.34)  can  then  be  rewritten  as: 

[  dx  [  \K{x,x){a^  -fjfdx 
A  Jo  Jo 


(4.34) 


\k{x,  x){a^  —  ipfdx 


(4.35) 


Let  Cq,  =  be  the  interpolation  error.  Then,  using  (2.15), 

Na 
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Na  r- 


E/  IE  9*iix)9jix)g*{x)gj{x)eaix)\dx 


(4.37) 


ij 


-  \9iix)9jix)\  /  \g*{x)gj{x)\\eaix)\da 


m=l  iJ 

Na 


-J2Y1  \9*iix)9jix)\\\g*{x)gj{x) 


(4.38) 


m=l  iJ 


where  (4.38)  follows  from  the  Schwarz’  inequality.  Note  that  g*gj  G  since 
gfg,eL^hY{2.8). 

We  now  use  (4.35)  and  (4.38)  to  obtain 


j\\T^\\y^x„  j  dx(^  ^^\n{x,x){a^  -  fjfdx  ) 


<  -||T„||y^x„  J^dx 

Na  Na,Ns 

X  lf/*(a?)f/i(a?)llkr(^)f/i(^)llo,m||e„(i)||o,m.  (4.39) 

m=l  i,j 

Using  the  bound  (4.39)  in  (4.34)  and  substituting  the  interpolation  error  bound  [11] 
||Ca||o,m  E  C*||cr  (4.40) 

we  obtain 


~  C^nlUbf^)  —  —  TnlW^Xr,  ^  \\oi^\\l,mhn 


Na  N^,Ns 


+  y  ll^n||y^x„  ^  ^2  \\9i9j\\Li(n)\\9i9j\\o,m\\a^\\i,mK 


m=l  iJ 


<  C^\\I  -  Tn\\Y^x„  Y1 


Na  Nii,Ng 


+  -\\Tn\\Y^x„max\\g*gj\\L^n) 


(4.41) 


m=l  iJ 


Remarks:  (i)  Theorem  1  shows  the  spatial  dependence  of  the  inverse  problem 
discretization  on  the  forward  problem  solution. 

(ii)  The  hrst  term  in  (4.41)  suggests  that  the  mesh  of  the  inverse  problem  be  rehned 
where  ||q;'’'||i  is  large. 
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(iii)  The  second  term  in  (4.41)  shows  that  the  term  ||Q;^||i,m  is  scaled  spatially  by 

Thus,  the  effect  of  the  interpolation  error  Ca  in  the  inverse  problem 
solution  is  scaled  by  the  solution  of  the  forward  problem. 

(iv)  The  regularization  parameter  affects  the  bound  on  ||q;^  — 

(v)  Increasing  the  number  of  sources  and  detectors  will  increase  the  bound  on 
l|a^-anlUho)- 


4.2.  Case  2:  The  kernel  is  degenerate 


In  this  section,  we  first  derive  approximate  upper  bounds  for  the  approximation  errors 
||/C  — K||  and  II7— 7II,  which  result  from  the  discretization  of  the  forward  problem.  Then, 
we  show  the  effect  of  these  approximation  errors  on  the  accuracy  of  the  reconstructed 
optical  image.  For  notational  convenience,  we  will  drop  the  subscripts  on  the  norms  ||  ■  || 
where  necessary. 

The  operator  K  :  X  ^  Y  is  bounded  with  a  bounded  inverse  K~^  :  Y  ^  X. 
By  the  finite  element  approximation  of  the  associated  kernel,  the  sequence  of  bounded 
linear  finite-dimensional  operators  K  is  norm  convergent  \\K  —  K\\  — >  >  cx), 

for  j  =  1,  •  •  • ,  Ns  and  i  =  1,  •  •  • ,  and 

<  lA,  (4.42) 

which  can  be  obtained  analogous  to  (4.28). 

In  the  following,  we  derive  an  explicit  approximation  to  the  error  —  K\\  in 
terms  of  the  associated  kernel  and  the  discretization  error  in  the  kernel  approximation. 
The  result  is  then  used  to  compute  the  error  in  the  reconstructed  optical  image  due  to 
\\k -K\\. 

By  definition. 


II  (A 


NdXs 

Aa)0i\\il  =  ^ 


fn(ffHx)gj(x)  -  G*(x)Gj(x))a(x)dx 


(4.43) 


where  Gf  Gj  are  finite  element  approximations  to  g*  and  gj,  respectively.  We  can 
expand  g*gj  —  G*Gj  as 

9i9j  —  G*Gj  =  e*ej  +  Gje*  +  G*ej,  (4.44) 

where  e*  =  g*  —  G*  and  =  gj  —  Gj.  Replacing  G*  and  Gj  respectively  with  g*  —  e* 
and  gj  —  Cj,  we  get 

k9j  -  Q  A  =  9j^  +  ^ej  -  ffcj 

-93<+¥Xj,  (4.45) 

where  we  neglect  the  term  e*ej. 

We  can  express  K  —  K  as 

k 


K  = 


(4.46) 
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Following  a  similar  approach  as  above, 

=  (a:  -  aixa,  -  aa + +  (a:  -  i:)4. 

As  a  result,  the  following  condition  holds: 

II A-  -  A'li  <  ||(^;  -  a:)(a,  -  ijii  +  \\a:(a^  -  ij  +  (yi;  -  A-jA^. 
Since  Aa  =  —{Aa  —  Aa)  +  Aa,  (4.48)  can  be  rewritten  as 
l|A'  -  A-||  =  ||.4;yi„  -  i;i.|| 

<  IK-'i:  -  -4:)(4.  -  i„)||  +  2||4;(4„  -  ijll 
«2||4:(4.-4„)||, 

where  we  neglect  the  term  ||(A*  —  A*A{Aa  —  Aa)\\. 

Similarly,  ||7  —  7||  can  be  interpreted  as 

||7-7l|Li(f2)  =  /  I  ^3i{x)gj{x)  -  G*{x)Gj{x))T{xfxl)\dx 

.  Nd,Ns 


(4.47) 

(4.48) 


(4.49) 


(^i(x)9j{x)  +  9*{x)ej{x))T{i,j)\dx, 


(4.50) 


where  the  error  in  F  due  to  discretization  is  neglected  and  the  last  approximation  is 
derived  similar  to  (4.45). 

We  now  analyze  the  effect  of  the  forward  problem  discretization  on  the  accuracy  of 
the  reconstructed  optical  image.  Let  be  the  solution  of 

Ka^  =  7,  (4-51) 

where  K  and  7  are  the  hnite  dimensional  approximations  to  K  and  7,  respectively. 
Then,  by  Theorem  10.1  in  [21],  the  error  in  the  solution  with  respect  to  the  actual 
solution  is  bounded  by 

ll«^  -  a^ll  <  ^  {  \\{K-  K)aA\  +  ||7  -  bll  }  •  (4-52) 

In  the  next  theorem,  we  will  expand  the  terms  in  (4.52)  to  show  explicitly  the  effect 
of  the  forward  problem  discretization  on  the  accuracy  of  the  inverse  problem  solution. 

Theorem  2:  Let  and  hA  be  the  diameter  of  the  smallest  ball  that  contains  the 

element  in  the  hnite  element  solutions  G*  and  Gj,  respectively.  Then,  a  bound  for 
the  error  in  the  solution  due  to  operator  approximation  K  is  given: 

G 


\a 


d  lUi(o)  <  -^max\\g*gj\\Li{n) 

A 


X 


Va  Nd,N, 

EZ 

\m=l  ij 
Wa  Nd,N^ 

E  E  ( 

m=l  ij 


||o,m  “f*  ||Q^||oo||fi'j||o,m 
“1“  11*^  II 00  ||o,m 


|0,m 


UlkmK, 


(4.53) 
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where  C*  is  a  positive  constant. 

Proof.  Using  (4.49),  (4.43),  and  (4.45),  we  can  write 

\\{K  -  K)a^\\mn)  ~  2\\A:(A^  -  ia)a"|Ui(n) 


s  n 

2il  9*i{x)gj{x)  /  (  gj{x)e*{x)  +  g*{x)ej{x)  \  a^{x)dx\\Li^^) 

Nd,Ns  „  _  _ 

2max\\g*gj\\Li(^n)  ^  jj(^  gj{x)e*{x)  +  g*{x)ej{x)  )  a^{x)\dx. 


(4.54) 


An  upper  bound  for  the  integral  in  (4.54)  can  be  obtained  as  follows: 


gj{x)e*{x)  +  g*{x)ej{x)  )a  {x)\dx 


<  X]  lie* ||o,m||fi'itt^||o,m  +  II  II  0,m  llfi'i  ei  ||o,m- 


(4.55) 


Note  that  gja^  G  since  \gja^\  <  1(7^1  ||q;^||oo-  Similarly,  g*a^  G  since 
Ifi-itt^l  <  |5'*|||a^||oo-  Using  (4.55)  in  (4.54), 

||(i^-;^)a"|Ui(n) 

Na  Na,N^ 

<  2max||c/;c/j||ii(t^)  EE  ll^j  llOjmllfi'jCl  ||o,m4“  II  ej  II  0,m  llfi'i  Cl  ||o,m- 

.... 


m=l  ij 


(4.56) 


To  compute  an  upper  bound  for  ||7  —  7||,  we  hrst  write 


.  Na,Ns  _  _ 

<max|r(bj)|  /  V  \e*{x)gj{x)  +  g*{x)ej{x)\dx 

Na  N^,Ns 

<  max|r(b  j)| 

m=l  i,j 

Noting  (2.9), 

max|r(bj)|  <  max||5(;5(j||ii(Q)||a||oo, 

which  leads  to 

Na  Nd.Ns  ^ 

max  |r(i,  j)  I  ^  ^  (  11^^  ||o,m||fi'j||o,m  +  llfi'i  ||o,m||^j  ||o,m 

m=l  i,j 

^  II  11^,11  f  X^Na  ^Na,Ns  |I  II  I 


(4.57) 


(4.58) 


<  max||c/*^j||ii(j^)||a||oo  (  E^ti  Ei,/’  "  lie* ||o,m||c/il|o,m  +  ||c/r||o,m||ej||o,m  )•  (4.59) 
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We  now  use  (4.56),  (4.59),  the  corresponding  discretization  error  estimates  (3.24)- 

(3.25),  and  (4.52)  to  obtain  (4.53).  □ 

Remarks:  (i)  Theorem  2  suggests  the  use  of  meshes  designed  individually  for  the 
solutions  Gj,  j  =  1,  •  •  • ,  Nj  and  G*,  i  =  1,  •  •  • ,  W- 

(ii)  Theorem  2  states  explicitly  the  effect  of  the  forward  problem  discretization 
on  the  accuracy  of  the  inverse  problem  solution.  In  this  context.  Theorem 
2  suggests  a  discretization  scheme  for  the  forward  problem,  where  the 
discretization  criterion  is  based  on  the  inverse  problem  solution  accuracy,  rather 
than  the  accuracy  of  the  forward  problem  solution. 

(iii)  For  each  source,  when  solving  for  Gj,  hf  has  to  be  kept  small  where 
(2||5(*a^||o,m  +  ||a||oo||fl'r||o,m)||5'illi,m  is  large.  Note  that  \\gj\\i,m  will  be  large  on 
the  elements  close  to  the  source. 

(iv)  For  each  detector,  when  solving  for  G*,  has  to  be  kept  small  where 

(2||5(ja^||o,m  +  ||a||oo||fi'ji|o,m)||fi'*||i,m  is  large.  Note  that  will  be  large  on 

the  elements  close  to  the  detector. 

(v)  l^fjl  and  \g*\  are  higher  close  to  the  sources  and  detectors,  respectively. 
Therefore,  has  to  be  small  around  the  source  and  around  all  detectors, 
where  is  nonzero.  Likewise,  has  to  be  small  around  the  detector  and 
around  all  sources,  where  is  nonzero. 

(vi)  If  is  nonzero  on  the  whole  domain  fl,  then  the  error  may  become  higher 
depending  on  the  magnitude  of  \gj\  and  \g*\. 

(vii)  The  regularization  parameter  affects  the  bound  on  ||q;^  — 

(viii)  Increasing  the  number  of  sources  and  detectors  increases  the  bound  on 

II®'*'  ~  ®'*'||l1(0)- 

4-3.  Iterative  Born  Approximation 

In  this  section,  we  explore  the  error  in  the  inverse  problem  solution  within  an  iterative 
linearization  approach. 

The  error  analysis  presented  in  this  paper  covers  the  error  which  results  from  the 
discretization  of  the  forward  and  inverse  problems.  If  a  is  sufficiently  low,  then  one 
iteration  suffices  to  solve  the  inverse  problem  and  the  error  analysis  discussed  above 
applies.  When  iterative  linearization  is  considered  to  address  the  nonlinearity  of  the 
inverse  problem,  we  can  make  use  of  the  error  analysis  at  each  linearized  step  as  follows: 
At  the  end  of  the  (r  —  1)*^  linearization  step,  the  absorption  coefficient  estimate  is  given 
by  fa~^\x)  =  pi'a\x)  +  ®n(p’  where  oA(t)  error  due  to  discretization  with 

respect  to  the  actual  solution  In  the  next  linearization,  an  error  on  the  new  solution 
update  jla\x)  will  be  introduced  due  to: 

(i)  projection. 
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(ii)  the  error  {K  —  in  the  operator  {KY^  and  the  error  (7  —  7)^'’  in  (7)('’ 

resulting  from  forward  problem  discretization,  and 

(hi)  the  errors  in  and  (d)^-!)  clue  to  error  in  Gj  and  G*  resulting  from  the 

^(r— 1)/  \ 
error  m  f  a  [x). 

As  a  result,  the  error  in  fYf\x)  at  the  iteration  is  bounded  by: 

r  r 

\\t^a{x)  -  fYf\x)\\  =  ““n(t)ll>  (4-60) 

t=i  t=i 

assuming  that  the  background  absorption  plY\x)  is  approximated  accurately. 


5.  Adaptive  mesh  generation 

In  this  section,  we  hrst  discuss  the  adaptive  mesh  design  for  the  discretization  of  the 
forward  and  inverse  problems  based  on  Theorems  1  and  2,  such  that  the  resulting  error 
in  the  reconstructed  optical  image  due  to  each  discretization  is  below  a  preset  bound. 
Next,  we  present  practical  adaptive  mesh  generation  algorithms  for  the  forward  and 
inverse  problems,  respectively. 


5.1.  Adaptive  Mesh  Generation  for  the  Forward  Problem 


Let  the  mesh  parameter  h^  for  gj  for  j  =  1,  -  ■  ■  ,Ns  and  the  mesh  parameter  hf^  for  g* 
ioT  i  =  1,  ■  ■  ■ ,  Nd  he  chosen  so  that: 


hi  < 


m  — 


Yli  +  ||Qi||oo||fi'j*||o,m)||5'j||l,» 

_ _ 


hi  < 


—  Bj, 


=  Bl 


(5.61) 

(5.62) 


where  the  tolerance  Cf  will  be  dehned  later.  Then,  by  Theorem  2,  the  error  in  the 
reconstructed  image  due  to  the  forward  problem  discretization  is  bounded  by: 


G 

—  max 
A  i,j 


\\9Kx)gj{x)\\Li(Q)NA{Nd  +  Nfcf 


(5.63) 


where  C*  is  a  positive  constant  and  e-^  is  the  total  allowable  error  in  the  reconstructed 
optical  image  due  to  the  forward  problem  discretization.  Equation  (5.63)  implies  the 
following  value  for  e/: 


\lf  1 

G{Nd  +  Ns)Na  raaxij  \\g*{x)gj{x)\\Li(n) ' 


(5.64) 


In  practice,  Bj  and  B*  in  (5.61)-(5.62)  can  not  be  computed  since  a,  a^,  gj,  and 
g*  are  unknown.  However,  Bj  and  B*  can  be  estimated  by  using  approximations  for 
the  functions  involved  in  these  bounds,  based  on  either  a  priori  information  or  on  the 
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recent  forward  and  inverse  problem  solution  updates.  Then,  the  elements  whose  mesh 
parameter  {h\f)  exceeds  Bj  (B*)  can  be  determined  and  rehned  to  satisfy  (5.61)- 
(5.62). 

Algorithm  1  outlines  the  adaptive  mesh  generation  algorithm  for  the  forward 
problem  in  the  form  of  a  pseudocode.  The  algorithm  is  performed  for  each  source 
and  detector  before  the  linearization  of  the  inverse  problem  and  it  yields  a  family  of 
adaptively  rehned  meshes  with  conforming  elements.  We  use  Rivara’s  algorithm  [25]  for 
rehnement. 


Algorithm  1  The  pseudocode  for  the  mesh  generation  algorithm  for  the  forward 
problem,  prior  to  the  linearization  of  the  inverse  problem, 
o  Generate  an  initial  uniform  mesh  (A,iVA),  A  = 
o  Set  e-^ 

o  Initialize  the  set  of  marked  elements:  Mg  ^ 
o  flag  =  True 

while  hag  =  True 

for  each  element  A^  €  A  with  mesh  parameter  {h\f) 
if  hrst  linearization 

.  Use  analytical  solutions  for  gj  and  g*  and  a  priori  anatomical 
information  about  a  to  compute  the  bound  Bj  in  (5.61)  {B*  in  (5.62)) 

else 

.  Use  current  solution  updates  Gj  and  G*  and 
to  compute  Bj  in  (5.61)  {B*  in  (5.62)) 

end 

if  hL  >  Bj  {hi^  >  B*) 

.  Me  ^  Me  U{Am} 
end 
end 

if  Me  {} 

.  Rehne  the  marked  elements  and  update  the  mesh  A 

.  Mg  ^ 

else 

.  hag  =  False 

end 

end 

o  Solve  for  Gj  (G*). 
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In  practice,  after  the  first  sweep  of  refinement,  one  can  compnte  the  bonnd  Bj 
and  B*  only  for  the  new  elements.  We  note  that  for  the  initial  mesh  design,  we  use  a 
model  problem  to  compute  the  terms  in  the  error  bound  relevant  to  the  forward  problem 
solution  (see  Appendix  D).  If  there  is  no  a  priori  information,  can  be  assumed  to 
be  spatially  constant  at  the  hrst  linearization  step.  After  the  hrst  linearization,  the 
norms  in  Bj  and  B*  relevant  to  gj  and  g*  are  not  expected  to  change  signihcantly.  In 
this  context,  the  terms  in  (5.61)  and  (5.62)  can  be  bounded  by 

||fi'illo,m||tt'*'||oo,m  and  11(7*110, m||'a'*'||oo,m,  respectively.  Therefore,  one  can  store  the  norms 
||5(j||o,m  and  ||(7*||o,m  at  the  end  of  the  hrst  mesh  generation,  and  update  Bj  and  B*  in 
the  following  mesh  generations  by  using  these  stored  values  and  the  updated  ||(a'^||oo,m 
values. 

In  case  e/  (e)-)  can  not  be  chosen  in  prior,  we  consider  a  posterior  approach,  set 
Cf  =  1  (ej  =  1),  and  compute  hf^/Bj  {h\^/ B*)  on  each  element,  which  is  used  as 
the  indicator  for  rehnement.  Then,  the  elements  with  indicator  value  exceeding  the 
average  hf^/Bj  {h\^l B*)  quantity  are  marked  for  rehnement.  We  note  that  in  this  case, 
the  algorithm  has  to  be  stopped  when  the  number  of  nodes  in  the  mesh  exceeds  the 
allowable  number  of  nodes. 

5.2.  Adaptive  mesh  generation  for  the  inverse  problem: 

Let  the  mesh  parameter  hm  for  the  solution  of  the  inverse  problem  be  dehned  as  follows: 

hm  <  (-inv!  —  T„||y^X„||Q;'^||l,m  +  ^||^n||v^X„ 

xmayi\\g*{x)gj{x)\\LH^a)  Y]  Ik* (a?)fi'i(a^)l|o,m||a^||i,m  =  (5.65) 

/ 

/ 

Then,  by  Theorem  1,  the  error  in  the  reconstructed  image  due  to  inverse  problem 
discretization  is  bounded  by 

CN^Cin,  =  (5.66) 

where  C*  is  a  positive  constant  and  is  the  total  allowable  error  in  the  reconstructed 
optical  image  due  to  inverse  problem  discretization. 

We  present  the  pseudocode  for  our  adaptive  mesh  generation  algorithm  used  at 
each  linearization  of  the  inverse  problem  in  Algorithm  2.  Similar  to  the  forward  problem 
discretization,  we  use  Rivara’s  algorithm  [25]  for  the  rehnement  of  the  elements. 

In  practice,  Binv  in  (5.65)  can  not  be  computed  since  a^,  gj,  g*,  and  are 
unknown.  Similar  to  the  approach  described  in  Section  5.1,  we  can  compute  an  estimate 
for  Binv  by  using  the  uniform  boundedness  of  the  operator  (See  Appendix  C)  and 
by  using  approximate  values  for  the  functions  involved  in  Binv  In  this  context,  we  use 
either  a  priori  information  or  the  recent  forward  and  inverse  problem  solution  updates 
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Algorithm  2  The  pseudocode  for  the  mesh  generation  algorithm  at  every  linearization 
step  of  the  inverse  problem. 

o  Generate  an  initial  uniform  mesh  (AjAa),  A  = 
o  Set 

o  Initialize  the  set  of  marked  elements:  Mg  ^ 
o  flag  =  True 

while  flag  =  True 

for  each  element  A^  €  A  with  mesh  parameter  hm 
if  first  linearization 

.  Use  current  solution  updates  Gj  and  G*  and  a  priori  information 
about  a  to  compute  in  (5.65) 

else 

.  Use  current  solution  updates  Gj  and  G*  and 
to  compute  in  (5.65) 

end 

if  hjYi  > 

end 

end 

if  Me  ^  {} 

.  Rehne  the  marked  elements  and  update  the  mesh  A 

■  Me  ^ 

else 

.  flag  =  False 

end 

end 

o  Solve  for  d)). 


to  calculate  (5.65)  on  each  element.  Then,  the  elements  with  the  mesh  parameter 
hm  >  Binv  are  determined  and  rehned  to  satisfy  (5.65). 

In  order  to  save  computations,  after  the  first  sweep  of  rehnement,  one  can  compute 
the  bound  B^v  only  for  the  new  elements.  Furthermore,  similar  to  the  approach 
described  in  Section  5.1,  the  term  \\g*{x)gj{x)\\Qm  in  (5.65)  can  be  stored  after  the  hrst 
mesh  generation  and  can  be  used  in  the  following  mesh  generations.  In  this  context, 
the  bound  B^v  can  be  updated  by  using  only  the  updated  ||Q;^||oo,m  value. 

Note  that,  in  practice,  one  of  the  two  terms  in  the  denominator  of  B^v  will  be 
dominant  depending  on  the  value  of  A.  Thus,  we  consider  only  the  dominant  term  for 
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the  computation  of  -Binu-  In  case  e/  can  not  be  chosen  in  prior,  we  consider  a  posterior 
approach,  set  Cinv  =  1  and  compute  \\a^\\i,rahm  or  \\g* {x)gj{x)\\o^rn\\a^\\i,mhm  on 

each  element,  which  are  used  as  rehnement  indicators.  Then,  the  elements  with  indicator 
value  which  exceeds  the  average  indicator  value  are  rehned.  In  this  case,  the  algorithm 
has  to  be  stopped  when  the  number  of  nodes  in  the  mesh  exceeds  the  allowable  number 
of  nodes. 

6.  Numerical  Experiments 

We  conduct  a  series  of  numerical  experiments  to  demonstrate  the  implications  of 
Theorems  1  and  2,  and  to  present  the  effectiveness  of  the  proposed  adaptive  mesh 
generation  algorithms.  We  perform  our  experiments  in  2D  for  ease  of  comparison  and 
image  quality  appraisals. 

In  the  hrst  simulation,  we  consider  a  series  of  image  reconstructions  to  show  the 
effectiveness  of  the  proposed  adaptive  mesh  generation  algorithms.  In  this  context,  we 
compare  the  images  reconstructed  by  using  uniform  meshes  for  the  forward  and  inverse 
problems  to  the  images  reconstructed  by  using  adaptive  meshes  which  are  designed 
based  on  Theorems  1  and  2. 

In  the  second  simulation,  we  show  the  effect  of  the  heterogeneity  size  on  the  accuracy 
of  the  reconstructed  absorption  images.  Next,  we  demonstrate  how  this  error  can  be 
addressed  by  the  proposed  adaptive  discretization  schemes. 

In  the  hnal  simulation  study,  we  demonstrate  the  implication  of  Theorem  2  and 
show  that  meshes  generated  for  the  forward  problem  by  using  discretization  error 
estimates  which  disregard  the  interaction  between  the  solutions  gj  and  g*  and  can 
lead  to  unstable  image  reconstructions.  We  note  that  the  proposed  adaptive  mesh 
generation  algorithm  for  the  forward  problem  addresses  this  problem. 

Note  that  in  all  experiments,  we  use  triangular  hnite  elements  with  piecewise  linear 
Lagrange  basis  functions  and  we  apply  Gaussian  elimination  method  to  solve  the  forward 
problem  (3.22)-(3.23)  and  the  linearized  inverse  problem  (3.26). 

6.1.  Simulation  Study  1 

In  this  simulation  study,  we  consider  the  geometry  shown  in  Figure  1(a).  We  simulate 
the  optical  data  by  solving  the  diffusion  equation  at  cu  =  0  on  a  hne  uniform  grid 
with  61  nodes  along  x  and  y  directions,  where  the  refractive  index  mismatch  parameter 
a  =  3.  11  sources  and  11  detectors  are  evenly  spaced  on  the  bottom  and  top  edges  of  the 
square,  respectively.  The  diffusion  coefficient  D{x)  =  0.0410  is  assumed  to  be  constant. 
The  circular  heterogeneity  with  absorption  coefficient  fia  =  0.2  cm“^  is  embedded  in  an 
optically  homogeneous  background  with  pa  =  0.04  cm“^. 
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(a)  The  optical  domain  and 
source-detector  configuration 
for  simulation  study  1. 


(b)  The  optical  domain  and 
source-detector  configuration 
for  simulation  study  3.  ri  = 
0.50  cm,  r2  =  0.75  cm,  = 
1.0  cm,  and  r4  =  1.25  cm. 


(c)  The  optical  domain  and 
source-detector  configuration 
for  simulation  study  2.  The 
radius  of  the  circles  is  0.75  cm. 


Figure  1.  The  setups  used  for  the  simulation  studies  1,  2,  and  3.  The  squares  and 
triangles  denote  the  detectors  and  sources,  respectively. 


In  order  to  obtain  a  series  of  of  absorption  imaging  problems  using  the  same  setup, 
we  consider  5  values  for  the  background  absorption  value.  Then,  for  each  imaging 
problem,  we  consider  three  mesh  scenarios:  Uniform  mesh  for  both  forward  and  inverse 
problems;  adaptive  mesh  for  the  forward  problem  and  uniform  mesh  for  the  inverse 
problem;  and  adaptive  meshes  for  both  forward  and  inverse  problems.  We  refer  to 
Table  2  for  a  brief  outline  of  the  first  simulation  study. 

Table  2.  The  mesh  scenarios  and  the  background  Pa  values  in  simulation  study  1. 


Mesh  (Forward) 

Mesh  (Inverse) 

Background  /Xq  (cm  ^) 

Uniform 

Adaptive 

Adaptive 

Uniform 

Uniform 

Adaptive 

0.032,  0.036,  0.040,  0.044,  0.050 
0.032,  0.036,  0.040,  0.044,  0.050 
0.032,  0.036,  0.040,  0.044,  0.050 

The  uniform  mesh  used  for  the  forward  problem  discretization  has  625  nodes  and 
is  shown  in  Figure  2(a).  The  uniform  mesh  for  the  inverse  problem  has  313  nodes  and 
is  shown  in  Figure  2(b).  We  use  the  algorithms  described  in  Sections  5.1  and  5.2  to 
generate  the  adaptive  meshes  for  the  forward  and  inverse  problems.  The  number  of 
nodes  in  each  of  the  adaptive  meshes  used  for  the  forward  problem  does  not  exceed  750. 
An  example  for  the  adaptive  mesh  generated  for  a  source  located  at  (1.0,0)  is  shown 
in  Figure  2(c).  For  an  example,  the  adaptive  mesh  for  the  inverse  problem  generated 
for  the  case  where  the  background  fia  =  0.050  cm“^  has  418  nodes  and  is  shown  in 
Figure  2(d). 
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(a)  The  uniform  mesh  with  625  nodes. 


(b)  The  uniform  mesh  with  313  nodes. 


(c)  The  adaptive  mesh  generated  for  the  (d)  The  adaptive  mesh  generated  for  the 
forward  problem  for  the  source  located  at  inverse  problem  solution,  with  418  nodes. 
(1.0,0):  Background  Ua  =  0.050  cm“^.  Background  pa  =  0.040  cm“^. 


Figure  2.  Examples  of  meshes  used  in  the  first  simulation  study. 


For  the  inverse  problem,  we  set  the  regularization  parameter  A  to  10“^  in  all 
experiments.  We  consider  the  image  reconstructed  by  using  hne  uniform  meshes  (61  x  61 
nodes  for  the  forward  problem  and  61  x  61  nodes  for  the  inverse  problem)  as  the  reference 
image,  which  is  assumed  to  possess  no  error  due  to  discretization.  We  compute  the  error 
II  —  0^11^1(0)  for  each  image  reconstruction  and  tabulate  the  results  in  Table  3.  We 
see  that  the  error  in  the  images  reconstructed  by  using  uniform  meshes  for  both  forward 
and  inverse  problems  is  signihcantly  reduced  by  the  use  of  adaptively  rehned  meshes. 
A  similar  behavior  is  observed  for  all  choices  of  background  absorption  value. 

We  present  image  reconstructions  in  Figures  3  and  4  for  the  two  extreme  cases, 
where  the  background  absorption  value  is  equal  to  0.032  and  0.050  cm“\  respectively. 
Figures  3(a)  and  4(a)  display  the  reference  images  used  to  compute  the  error  values 
given  in  Table  3.  Figures  3(c)  and  3(d)  show  that  the  optical  heterogeneity  is  resolved 
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Table  3.  The  error  ||a^  —  <5^11^1(0)  for  each  experiment  described  in  the  simulation 
study  1  and  Table  2.  The  first  column  shows  the  type  of  the  meshes  used  in  the  forward 
and  inverse  problems,  respectively.  The  unit  of  background  y-a  is  cm“^. 


Background  pa'- 

0.032 

0.036 

0.040 

0.044 

0.050 

U  niform-Uniform 

ll«^  • 

0.2325 

0.2559 

0.2773 

0.2932 

0.3013 

Adaptive-Uniform 

Adaptive-Adaptive 

1  “  “nl  Ll(0)  : 

0.1238 

0.1043 

0.1139 

0.0997 

0.1166 

0.0998 

0.1209 

0.1003 

0.1278 

0.1009 

better  by  using  adaptive  meshes  as  compared  to  the  reconstructed  image  obtained  by 
using  uniform  meshes,  which  is  shown  in  Figure  3(b).  These  results  are  consistent  with 
the  error  values  given  in  Table  3.  A  similar  trend  is  seen  in  Figures  4(c)  and  4(d).  Note 
that  the  number  of  nodes  in  the  adaptive  meshes  is  almost  equal  to  the  number  of  nodes 
that  the  uniform  meshes  have.  In  Figure  5,  we  show  the  cross-sectional  views  from  the 
reconstructed  images.  We  see  that  the  use  of  coarse  uniform  meshes  fails  to  resolve  the 
circular  heterogeneity  especially  for  the  case  in  which  the  background  p,a  =  0.032  cm“^. 

6.2.  Simulation  Study  2 

In  this  study,  we  consider  the  geometry  shown  in  Figure  1(b).  To  simulate  the  optical 
data,  we  use  the  same  source-detector  conhguration  considered  in  the  hrst  simulation 
study.  We  simulate  the  optical  data  by  solving  the  diffusion  equation  at  cu  =  0  on  a 
hne  uniform  grid  with  61  nodes  along  x  and  y  directions,  where  the  refractive  index 
mismatch  parameter  a  =  3.  The  diffusion  coefficient  D{x)  is  assumed  to  be  constant 
and  D{x)  =  0.0410  cm. 

We  consider  4  different  radii  for  the  circular  heterogeneity  with  pa  =  0.20  cm“^ 
embedded  in  a  background  with  Ha  =  0.040  cm“^  as  shown  in  Figure  1(b).  For  each 
case,  we  compute  the  error  for  different  mesh  scenarios,  similar  to  the  hrst  simulation 
study:  Uniform  mesh  for  both  forward  and  inverse  problems;  adaptive  mesh  for  the 
forward  problem  and  uniform  mesh  for  the  inverse  problem;  and  adaptive  meshes  for 
both  forward  and  inverse  problems.  The  adaptive  meshes  for  this  simulation  study  were 
generated  based  on  Theorems  1  and  2,  and  the  mesh  generation  algorithms  described 
in  the  hrst  simulation  study  and  Section  5.  The  uniform  meshes  used  for  the  forward 
and  inverse  problems  are  identical  to  those  used  in  the  hrst  simulation  study.  We  note 
that  the  number  of  nodes  in  the  adaptive  meshes  generated  for  the  forward  and  inverse 
problems  is  close  to  the  number  of  nodes  in  the  corresponding  uniform  meshes. 

In  Table  4,  we  tabulate  the  error  norm  ||q;^  —  a^\\  obtained  for  each  heterogeneity 
size  with  diherent  mesh  choices.  Table  4  shows  that  the  error  increases  with  increasing 
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(a)  The  optical  absorption  image  used  as  the 
reference  for  error  computations. 


(c)  The  reconstructed  absorption  image  using  an 
adaptive  mesh  for  the  forward,  and  the  uniform 
mesh  in  Figure  2(b)  for  the  inverse  problem. 


(b)  The  reconstructed  absorption  image  using  the 
uniform  mesh  in  Figure  2(a)  for  the  forward,  and 
the  uniform  mesh  in  Figure  2(b)  for  the  inverse 
problem. 


(d)  The  reconstructed  absorption  image  using  an 
adaptive  mesh  for  the  forward,  and  the  adaptive 
mesh  in  Figure  2(d)  for  the  inverse  problem. 


Figure  3.  The  reconstruction  results  of  simulation  study  1,  with  the  background 
Ua  =  0.032  cm“^. 


heterogeneity  size.  We  see  that  the  reduction  in  the  error  as  a  result  of  using  adaptive 
meshes  is  more  significant  for  smaller  sized  heterogeneities.  However,  further  reduction 
in  the  error  norm  ||q!^  —  is  possible  by  increasing  the  number  of  nodes  in  the  meshes. 

For  brevity,  we  only  show  the  reconstruction  results  for  the  extreme  cases:  r  =  0.5 
cm  and  r  =  1.25  cm.  We  note  that  the  regularization  parameter  A  =  5  x  10“®  in  all 
reconstructions.  Figures  6(a)  and  6(b)  show  the  images  used  as  the  reference  images 
in  the  calculation  of  the  error  norms  ||q:^  —  cefW  listed  in  Table  4.  Figures  6(e)-6(f)  show 
that  the  adaptive  meshes  reduce  the  artifacts  as  compared  to  the  images  reconstructed 
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I 


I 

(a)  The  optical  absorption  image  used  as  the 
reference  for  error  computations. 

10.2 

0.15 


I 

(c)  The  reconstructed  absorption  image  using  an 
adaptive  mesh  for  the  forward,  and  the  uniform 
mesh  in  Figure  2(b)  for  the  inverse  problem. 


0  0 


(b)  The  reconstructed  absorption  image  using  the 
uniform  mesh  in  Figure  2(a)  for  the  forward,  and 
the  uniform  mesh  in  Figure  2(b)  for  the  inverse 
problem. 


0  0 


(d)  The  reconstructed  absorption  image  using  an 
adaptive  mesh  for  the  forward,  and  the  adaptive 
mesh  in  Figure  2(d)  for  the  inverse  problem. 


Figure  4.  The  results  of  simulation  study  1,  with  the  background  Ua  =  0.050  cm 


by  using  uniform  meshes,  which  are  shown  in  Figures  6(c)-6(d). 

6.3.  Simulation  Study  3 

In  this  simulation  study,  we  consider  the  geometry  shown  in  Figure  1(c).  The  center  of 
the  circular  heterogeneity  is  moved  vertically  towards  the  detector  side  to  see  the  effect 
on  the  imaging  accuracy.  Next,  we  show  how  the  error  in  the  reconstructed  images 
due  to  discretization  can  be  addressed  by  using  appropriate  meshes  for  the  solutions  of 
the  forward  and  inverse  problems.  In  this  context,  we  compare  the  results  obtained  by 
using  1)  uniform  meshes,  2)  the  adaptive  meshes  generated  using  conventional  a  priori 
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(a)  The  cross-sectional  cuts  taken  from  Fig¬ 
ures  3(a),  3(b),  and  3(d),  along  x  direction  at 
y  =  3.  The  solid,  square,  and  diamond  lines 
correspond  to  the  cross-sectional  cuts  taken 
from  the  images  shown  in  Figures  3(a),  3(d), 
and  3(b),  respectively. 


(b)  The  cross-sectional  cuts  taken  from  Fig¬ 
ures  4(a),  4(b),  and  4(d),  along  x  direction  at 
y  =  3.  The  solid,  square,  and  diamond  lines  cor¬ 
respond  to  the  cross-sectional  cuts  taken  from 
the  images  shown  in  Figures  4(a),  4(d),  and  4(b), 
respectively. 


Figure  5.  The  cross-sectional  views  from  the  reconstructed  images  in  simulation  study 
1,  corresponding  to  the  cases  where  the  background  pa  =  0.032  and  pa  =  0.050  cm“^, 
respectively. 


Table  4.  The  norm  of  and  the  error  ||a'''  —  d()||Li(n)  for  each  experiment 
described  in  the  simulation  study  2.  The  first  column  shows  the  type  of  the  meshes 
used  in  the  forward  and  inverse  problems,  respectively.  The  radius  of  the  circular 
heterogeneity  is  given  in  cm. 


Radius: 

0.50 

0.75 

1.0 

1.25 

0.7196 

1.3760 

1.4759 

1.7817 

U  niform-Uniform 

1 

- 

0.5622 

0.5706 

0.5850 

0.6337 

Adaptive-Uniform 

1 

-  dnlUh*^)- 

0.2153 

0.2776 

0.3766 

0.5113 

Adaptive-Adaptive 

1 

0.2020 

0.2630 

0.3592 

0.5034 

discretization  error  estimates,  and  3)  the  adaptive  meshes  proposed  in  this  study.  By 
conventional  error  estimates,  we  mean  the  a  priori  discretization  error  estimates  (3.24) 
and  (3.25)  for  the  solution  of  the  forward  problem,  and  the  a  priori  interpolation  error 
estimate  (4.40)  for  the  solution  of  the  inverse  problem. 

To  simulate  the  optical  data,  we  use  the  same  source-detector  configuration 
considered  in  the  first  simulation  study.  We  simulate  the  optical  data  by  solving 
the  diffusion  equation  at  a;  =  0  on  a  fine  uniform  grid  with  61  nodes  along  x  and  y 
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directions,  where  the  refractive  index  mismatch  parameter  a  =  3.  We  note  that,  in 
all  reconstrnctions,  the  backgronnd  absorption  valne  is  set  to  /ia  =  0.04  cm“^  and  the 
diffnsion  coefficient  D{x)  is  assnmed  to  be  constant  and  D{x)  =  0.0410  cm. 

The  nniform  meshes  used  in  this  simulation  study  are  identical  to  the  ones  used  in 
simulation  studies  1  and  2.  Sample  meshes  for  the  forward  problem  solution  using  the 
conventional  and  the  proposed  adaptive  meshing  strategies  are  shown  in  Figures  7(a)- 
7(b)  and  Figures  8(c)  and  8(d),  respectively.  We  see  that  the  conventional  adaptive 
mesh  generation  strategy  leads  to  meshes  rehned  around  only  sources  or  detectors,  but 
not  both.  In  contrast.  Figures  8(c)  and  8(d)  show  that  the  proposed  strategy  results  in 
adaptive  meshes  rehned  around  sources,  detectors,  and  the  heterogeneity  as  well.  This 
observation  is  consistent  with  Theorem  2.  The  adaptive  mesh  for  the  inverse  problem 
solution,  which  was  generated  using  the  a  priori  interpolation  error  estimate  (4.40)  is 
shown  in  Figure  8(f).  Note  that  the  mesh  was  generated  for  the  case  where  the  circular 
heterogeneity  was  centered  at  (3.0, 3.5).  The  mesh  generated  based  on  Theorem  1 
(Figure  8(e))  provides  higher  resolution  close  to  the  sources  and  detectors  as  compared 
to  the  mesh  shown  in  Figure  8(f),  which  is  merely  rehned  around  the  heterogeneity. 

In  this  simulation  study,  we  consider  4  different  positions  for  the  center  of  the 
circular  heterogeneity  with  radius  0.75  cm,  along  y— axis:  center  at  (3. 0,3.0),  (3. 0,3. 5), 
(3. 0,4.0),  and  (3. 0,4. 5),  respectively.  Similar  to  the  previous  simulations,  we  compute 
the  error  in  the  reconstructed  images  for  all  cases,  and  compare  the  error  values  attained 
by  different  meshing  strategies.  Finally  we  present  the  reconstructed  images  obtained 
by  using  different  mesh  strategies  corresponding  to  the  case  where  the  circular  inclusion 
is  centered  at  (3.0,  3.5)  and  (3.0, 4.0). 

Using  the  meshes  for  the  forward  problem  discretization  (see  Figures  7(a)- 
7(b))  which  were  generated  by  using  the  conventional  a  priori  discretization  error 
estimates  (3.24)-(3.25)  leads  to  the  image  reconstructions  shown  in  Figures  7(c) 
and  7(d),  where  the  regularization  parameter  A  =  10“®.  We  observe  that  the  finite¬ 
dimensional  operator  does  not  provide  a  stable  solution.  We  note  that  using  an  adaptive 
mesh  for  the  inverse  problem  solution  does  not  change  the  outcome  (Figure  7(d)).  Note 
also  that  the  meshes  generated  by  using  the  conventional  a  priori  discretization  error 
estimates  (3.24)-(3.25)  are  sufficient  to  provide  accurate  hnite  element  approximations 
to  the  actual  solutions  pj  and  g*.  Therefore,  the  unstable  reconstructions  can  be 
attributed  to  the  errors  K  —  K  and  7  —  7,  due  to  inappropriate  discretization  as  noted 
by  Theorem  2.  In  consistence  with  Theorem  2,  this  observation  suggests  that  solving 
the  forward  problem  accurately  does  not  necessarily  imply  that  approximate  operator 
K  and  7  are  error-free.  Therefore,  in  order  to  address  such  problems,  one  has  to  follow 
a  discretization  scheme  based  on  Theorem  2  for  the  solution  of  the  forward  problem, 
which  takes  into  account  the  interaction  between  the  solutions  of  the  diffusion  equation 
and  the  associated  adjoint  problem,  as  described  in  Section  5.1. 
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In  order  to  suppress  the  severe  artifacts  observed  in  Figures  7(c)  and  7(d),  we 
increased  the  regularization  parameter  and  set  A  =  10“®.  The  resulting  images  are  shown 
in  Figures  7(e)  and  7(f).  As  noted  by  Theorems  1  and  2,  increasing  the  regularization 
parameter  reduces  the  error  in  the  reconstructed  images.  However,  increasing  the 
regularization  parameter  will  also  compromise  the  image  quality  and  lead  to  over¬ 
smoothed  images.  In  order  to  address  the  instability  issue  without  degrading  the 
image  quality  by  using  high  regularization  parameters,  we  modified  the  adaptive  mesh 
generation  method  that  lead  to  the  meshes  shown  in  Figures  7(a)-7(b).  In  this  context, 
for  the  first  2  rehnements,  we  used  the  proposed  mesh  generation  algorithm  based  on 
Theorem  2  to  generate  an  initial  adaptive  mesh;  and  for  the  next  2  rehnements,  we 
used  the  conventional  error  estimates  (3.24)-(3.25).  Following  this  modihcation,  the 
samples  of  the  resulting  adaptive  meshes  are  shown  in  Figures  8(a)  and  8(b).  For  a 
comparison,  we  also  present  in  Figures  8(c)-8(d),  the  adaptive  meshes  generated  by 
using  the  proposed  adaptive  mesh  generation  algorithms  as  described  in  Section  5.1. 
We  observe  that  the  meshes  shown  in  Figures  8(c)-8(d),  indicate  further  rehnement 
around  sources,  detectors  and  the  circular  heterogeneity  as  compared  with  the  adaptive 
meshes  shown  in  Figures  8(a)  and  8(b). 

Examples  of  the  adaptive  meshes  generated  for  the  inverse  problem  based  on 
Theorem  1  and  the  conventional  a  priori  interpolation  estimate  (4.40)  are  shown  in 
Figures  8(e)  and  8(f),  respectively.  We  observe  that  the  adaptive  mesh  shown  in 
Figure  8(e)  provides  higher  resolution  around  sources  and  detectors  as  compared  to 
the  adaptive  mesh  shown  8(f). 

We  note  that  the  uniform  meshes  used  in  this  simulation  study  are  identical  to 
those  used  in  the  previous  simulation  studies. 

In  order  to  compare  the  performance  of  the  conventional  and  proposed  adaptive 
mesh  strategies,  we  perform  4  experiments  and  compute  the  error  in  the  reconstructed 
optical  absorption  images.  For  each  experiment  we  consider  5  different  mesh  strategies 
and  refer  to  Table  5  for  the  description  of  these  experiments. 

We  show  the  reconstructed  optical  absorption  images  for  the  two  cases  in  Figures  9 
and  10,  corresponding  to  the  circular  heterogeneity  centered  at  (3. 0,3. 5)  and  (3. 0,4.0), 
respectively.  Figures  9(a)  and  10(a)  show  the  reference  absorption  image  reconstructions 
which  are  used  to  compute  the  error  in  the  reconstructed  optical  images. 

Figure  9(b)  shows  the  image  reconstructed  using  coarse  uniform  meshes  for  both 
the  forward  and  inverse  problems,  for  the  case  where  the  circular  inclusion  is  centered 
at  (3. 0,3. 5)  where  the  regularization  parameter  was  set  to  A  =  10“®.  With  the  same 
value  of  the  regularization  parameter.  Figures  9(c)  and  9(e)  show  the  images  obtained 
by  using  the  adaptive  meshes  based  on  Theorems  2  and  1,  respectively.  We  observe 
the  improvements  especially  around  the  boundaries.  Using  the  conventional  adaptive 
meshes  for  the  forward  problem  solution,  which  were  modihed  around  sources  and 
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Table  5.  The  relevant  parameters  in  the  experiments  1-5  in  simulation  study  3. 
The  acronym  “Conv.”  implies  that  the  corresponding  mesh  was  generated  using 
the  conventional  a  priori  discretization  error  estimates  (3.24)-(3.25)  for  the  forward 
problem  solution,  and  the  conventional  a  priori  interpolation  error  estimate  (4.40) 
for  the  inverse  problem  solution.  The  acronym  “Prop.”  refers  to  the  adaptive 
meshes  generated  by  using  the  proposed  adaptive  mesh  generation  algorithms  based  on 
Theorems  1  and  2,  for  the  inverse  and  forward  problem  solutions,  respectively.  The  last 
column  in  the  table  shows  the  coordinates  of  the  center  of  the  circular  heterogeneity, 
considered  in  each  experiment. 


Mesh  (Forward) 

Mesh  (Inverse) 

Center  at: 

Exp. 

1 

Uniform 

Uniform 

[(3.0,3.0),  (3.0,3.5),  (3.0,4.0),  (3.0,4.5)] 

Exp. 

2 

Adaptive  (Conv.) 

Uniform 

[(3.0,3.0),  (3.0,3.5),  (3.0,4.0),  (3.0,4.5)] 

Exp. 

3 

Adaptive  (Conv.) 

Adaptive  (Conv.) 

[(3.0,3.0),  (3.0,3.5),  (3.0,4.0),  (3.0,4.5)] 

Exp. 

4 

Adaptive  (Prop.) 

Uniform 

[(3.0,3.0),  (3.0,3.5),  (3.0,4.0),  (3.0,4.5)] 

Exp. 

5 

Adaptive  (Prop.) 

Adaptive  (Prop.) 

[(3.0,3.0),  (3.0,3.5),  (3.0,4.0),  (3.0,4.5)] 

detectors  as  noted  before,  we  ran  into  a  similar  instability  problem.  Therefore,  in  order 
to  obtain  better  reconstructions  with  the  conventional  adaptive  meshes,  we  set  the 
regularization  parameter  A  =  10“^  in  the  corresponding  inverse  problem  formulations. 
The  resulting  reconstructed  images  are  shown  in  Figures  9(d)  and  9(f).  In  this  case, 
we  observe  that  the  use  of  conventional  adaptive  meshes  for  the  forward  and  inverse 
problems  does  not  improve  the  image  quality  as  compared  to  the  reconstructed  image 
shown  in  Figure  9(b),  which  is  obtained  by  using  coarse  uniform  meshes. 

We  observe  similar  results  for  the  case  where  the  circular  inclusion  is  centered 
at  (3. 0,4.0).  We  note  that  the  regularization  parameter  is  set  to  A  =  10“®  for  all 
reconstructions  except  for  the  reconstructions  obtained  by  using  conventional  adaptive 
meshes,  in  which  case  A  =  10“^.  Figures  9  show  the  reconstructed  images  corresponding 
to  all  meshing  strategies. 

Table  6  shows  the  error  norm  computations  for  all  cases.  The  error  values  are 
consistent  with  Figures  9  and  10.  In  all  cases,  the  proposed  adaptive  meshes  signihcantly 
reduce  the  error  in  the  reconstructed  images.  Furthermore,  the  image  quality  is  enhanced 
by  merely  appropriate  discretization,  without  having  to  increase  the  regularization 
parameter.  In  contrast,  the  conventional  adaptive  meshes  perform  worse  than  uniform 
meshes  even  though  a  higher  regularization  parameter  is  used. 

7.  Conclusion 

In  this  work,  we  presented  an  error  analysis  to  show  the  relationship  between  the  error 
in  the  reconstructed  optical  absorption  images  and  the  discretization  of  the  forward 
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Table  6.  The  error  ||q;'''  —  11^1(0)  for  each  experiment  described  in  the  simulation 

study  3.  The  first  column  shows  the  type  of  the  meshes  used  in  the  forward  and  inverse 
problems,  respectively.  The  superscript  ‘C’  denotes  that  the  corresponding  adaptive 
mesh  generation  is  based  on  conventional  a  priori  error  estimates. 


Radius  at: 

(3.0,3.0) 

(3.0,3.5) 

(3.0,4.0) 

(3.0,4.5) 

U  niform-Uniform 

1  ~  <5^1  Li(o): 

0.4539 

0.4606 

0.4733 

0.4956 

Adaptive-Uniform 

Adaptive-Adaptive 

“  «nl  Li(o): 

l|a^  -  ^uhnny- 

0.2690 

0.2433 

0.2695 

0.2455 

0.2634 

0.2459 

0.2507 

0.2434 

Adaptive-Uniform 

Adaptive-Adaptive 

\\(^^  -  ^n^hyny 

0.7989 

0.8011 

0.7596 

0.7614 

0.7072 

0.7070 

0.6418 

0.6351 

and  inverse  problems.  We  summarized  the  implications  of  the  error  analysis  in  two 
theorems  and  proposed  novel  adaptive  mesh  generation  algorithms  for  the  forward  and 
inverse  problems.  The  theorems  provide  an  insight  into  the  impact  of  forward  and 
inverse  problem  discretizations  on  the  accuracy  of  the  reconstructed  optical  absorption 
images.  These  theorems  show  that  the  error  in  the  reconstructed  optical  image  due 
to  the  discretization  of  each  problem  is  bounded  by  roughly  the  multiplication  of  the 
discretization  error  in  the  corresponding  solution  and  the  solution  of  the  other  problem. 
Theorem  2  shows  that  solving  the  diffusion  equation  and  the  associated  adjoint  problem 
accurately  may  not  ensure  small  values  for  — i^||  and  II7— 7||,  which  may  lead  to  large 
errors  in  the  reconstructed  optical  images,  depending  on  the  value  of  the  regularization 
parameter  as  well. 

Based  on  the  error  analysis,  we  provided  two  practical  adaptive  mesh  generation 
algorithms,  one  for  the  forward  and  one  for  the  inverse  problem,  which  take  the 
interdependence  between  the  two  problems  into  account.  Our  simulation  experiments 
showed  that  the  proposed  mesh  generation  algorithms  signihcantly  improve  the  accuracy 
of  the  reconstructed  optical  images  while  keeping  the  number  of  unknowns  in  the  forward 
and  inverse  problems  within  the  allowable  limits.  Furthermore,  in  our  simulation  studies, 
we  showed  that  the  adaptive  mesh  generation  algorithms  based  on  conventional  error 
estimates  may  lead  to  high  error  in  the  reconstructed  images. 

The  error  analysis  presented  in  this  work  can  be  extended  to  show  the  effect  of  the 
discretization  error  on  the  accuracy  of  simultaneous  reconstruction  of  scattering  and 
absorption  coefficients.  Our  future  work  will  focus  on  the  error  in  the  simultaneous 
reconstruction  of  scattering  and  absorption,  resulting  from  the  discretization  of  the 
forward  and  inverse  problem.  Note  that  the  error  analysis  introduced  in  this  paper  is 
not  limited  to  DOT,  and  can  easily  be  adapted  for  similar  inverse  parameter  estimation 
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Appendix  A.  Bonndedness  of  Aa 

Nd,Ns 

\\Aaa\\ii  =  I  J^H{xixf,x)a{x)dx  .  (A.l) 

id 

We  can  write  the  following  inequality: 

Na,Ns  „ 

\\Aaa\\ii  <  Y  /  \H{xl,xfx)a(x)\dx 

<  (  !n\H{xfx\-x)\dx  )  ||a(i)||oo.  (A.2) 

Using  Schwarz’  inequality,  we  can  write  an  upper  bound  for  the  summation  as  follows: 

Nd^Ns  Nd,Ns 

Y  /  \Hixi,xf,x)\dx  =  Y  \\9*ix;x^^)g{x;xl)\\Li(n) 

i,j  ^  i,j 

Nd,Ns 

id 

<  ArfA^max||5(*||omax||5(j||o,  (A.3) 

*  j 

which  leads  to 

IIAaolIzi  <  NdNsraax\\g*\\oraax\\gj\\o\\a\\oo- 

*  3 

Therefore  an  upper  bound  for  the  norm  of  Aa  is  given  by 

\\Aa\\L^in)^ii  <  AdA^smaxllsiillomaxllsijIlo.  (A.4) 

^  j 

The  boundedness  of  gj  and  g*  imply  that  Aa  is  bounded.  □ 

Note  that  for  actual  Dirac-delta  source  and  Dirac-delta  adjoint  source,  gj  and  g* 
are  also  bounded  such  that  gj^g*  G  L^(f2)  [22], 

Appendix  B.  Compactness  of  Aa 

Aa  is  bounded  by  (A.4).  Furthermore  Aa  maps  the  inhnite  dimensional  subspace 

L°^  to  a  hnite-dimensional  subspace  that  is  the  range  R{Aa)  of  Aa  satishes 

R{Aa)  G  hnite  number  of  sources  and  detectors.  As  a  result,  Aa  is 

compact  [21].  The  inverse  problem  is  ill-posed  as  a  consequence  of  compactness  [21]. 

□ 
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The  identity  operator  J  is  a  bounded  operator  with  bounded  inverse  and  {Pnl)  ^  =  I  '■ 
Yn  ^  Yn.  Furthermore  ||-Pn||  is  bounded  [5] 

n 

||P„||  =  max^  |Lfc(a;)|,  (C.l) 

^  k=l 

where  Lk{x)  are  the  piecewise  linear  Lagrange  basis  functions.  A  =  is  bounded 

and  compact,  and  K  =  XI  +  A  is  injective.  As  a  result,  by  Theorem  13.7  in  [21],  the 
projection  method  also  converges  for  K  =  XI  +  A.  Convergence  of  projection  for  K 
implies  {PnK)-^PnKa^  a\  n  ^  oo  for  (P„A)-ip„A  :  X  ^  [21]. 

It  follows  from  the  proof  of  Theorem  13.7  in  [21]  that  [/  +  ^P„A]“^  :  Yn  X^  exists 
and  is  uniformly  bounded  for  all  sufficiently  large  n.  Then  from  PnK  =  APn[/+^PnA]  = 
A[/  +  jPnA],  it  follows  that  PnK  :  is  invertible  for  all  sufficiently  large  n  with 

the  inverse  given  by 

{PnK)-^  =  [I+^PnA]-^^.  (C.2) 

As  a  result  we  can  write  {PnK)~^PnK  as  follows: 

{PnK)-^PnK  =[I+  jPnA]-^jPnK.  (C.3) 

Thus, 

(C.4) 

using  the  fact  that  (/  +  ^P„A)“^  is  uniformly  bounded,  that  is  ||(/  +  |P„A)“^||y„^x„  < 
Cm  where  Cm  >  0  is  independent  of  n  and  ||P„||y^y^  =  1  for  piecewise  linear  Lagrange 
basis  functions.  □ 


Appendix  D.  Solution  of  the  Model  Problem 


In  order  to  initialize  the  adaptive  mesh  for  the  solution  of  the  forward  problem  (provided 
D{x)  =  D  and  Pai^)  =  JC  are  spatially  constant),  we  use  an  analytical  solution  to 
compute  the  estimates  of  pj  and  g*.  To  derive  an  analytical  2D  solution,  we  use  polar 
coordinates  to  rewrite  (2.1): 


pdp 


pd6 


4:nS{p-  Ps)S{9 
P  D 


es) 


1 


where  we  consider  an  unbounded  domain,  model  the  source  by  a  Dirac-delta  function 
and  =  —{paC  +  juj)/cD.  Then  the  solution  is  given  by  [18] 


4  f  ^ 

g{p,  ps]  0,  Os)  =  llo{knp<)Ko{knp>)  +  Em=i  cos[m(6'  -  9s)]Im{knp<)Km{knp>) 
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where  and  are  the  modihed  Bessel  functions  of  the  hrst  and  second  kind, 
respectively  [1]  and 

The  solution  of  the  problem  in  3D  can  be  derived  in  a  similar  manner  [18,  29]. 
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(a)  The  optical  absorption  image  used  as  the 
reference  for  error  computations.  The  image 
corresponds  to  the  reconstruction  of  the  circular 
heterogeneity  of  radius  0.5  cm. 


(c)  The  reconstructed  absorption  image  using 
the  uniform  mesh  in  Figure  2(a)  for  the  forward, 
and  the  uniform  mesh  in  Figure  2(b)  for  the 
inverse  problem. 

10.2 

0.15 


I 

(e)  The  reconstructed  absorption  image  using 
adaptive  meshes  for  both  the  forward  and  the 
inverse  problems. 


(b)  The  optical  absorption  image  used  as  the 
reference  for  error  computations.  The  image 
corresponds  to  the  reconstruction  of  the  circular 
heterogeneity  of  radius  1.25  cm. 


(d)  The  reconstructed  absorption  image  using 
the  uniform  mesh  in  Figure  2(a)  for  the  forward, 
and  the  uniform  mesh  in  Figure  2(b)  for  the 
inverse  problem. 


(f)  The  reconstructed  absorption  image  using 
an  adaptive  mesh  for  both  the  forward  and  the 
inverse  problems. 


Figure  6.  The  results  of  simulation  study  2.  The  left  and  right  columns  show  the 
reconstructed  images  regarding  the  optical  heterogeneity  with  radius  0.50  cm,  and  1.25 
cm,  respectively.  The  background  p,a  =  0.040  cm“^  in  all  of  the  reconstructions.  The 
reference  images  shown  in  (a)  and  (b)  are  obtained  using  a  uniform  mesh  with  61  x  61 
nodes  in  both  the  forward  and  inverse  problems. 
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(a)  The  adaptive  mesh  with  865  nodes  for 
the  forward  problem  solution  for  the  source 
located  at  (2.0,0),  generated  based  on  the 
conventional  a  priori  error  estimate  (3.24). 
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(b)  The  adaptive  mesh  with  865  nodes  for 
the  forward  problem  solution  for  the  detector 
located  at  (4. 0,6.0),  generated  based  on  the 
conventional  a  priori  error  estimate  (3.25). 


(c)  The  reconstructed  optical  image  using 
adaptive  mesh  for  the  forward  and  uniform 
mesh  for  the  inverse  problem.  A  =  10“®. 


(e)  The  reconstructed  optical  image  using 
adaptive  mesh  for  the  forward  and  uniform 
mesh  for  the  inverse  problem.  A  =  10“®. 


0  0 


(d)  The  reconstructed  optical  image  using 
adaptive  meshes  for  both  the  forward  and 
inverse  problems. A  =  10“®. 


(f)  The  reconstructed  optical  image  using 
adaptive  meshes  for  both  the  forward  and 
inverse  problems.  A  =  10“®. 


Figure  7.  (a)-(b)  Samples  of  adaptive  meshes  in  the  third  simulation  study,  generated 
by  using  the  conventional  error  estimates  (3.24)  and  (3.25),  which  led  to  unstable 
optical  image  reconstruction  shown  in  (c)  to  (f),  for  the  circular  heterogeneity  centered 
at  (3.0, 3.5).  (c)-(d)  The  unstable  optical  image  reconstructions  in  the  third  simulation 
study,  obtained  by  using  the  adaptive  meshes  for  the  forward  problem  solution  whose 
examples  are  shown  in  (a)-(b).  A  =  10“®.  (e)-(f)  The  unstable  optical  image 

reconstructions  in  the  third  simulation  study,  obtained  by  using  the  adaptive  meshes 
for  the  forward  problem  solution  whose  examples  are  shown  in  (a)-(b).  A  was  set  to 
10“®  to  suppress  the  significantly  large  artifacts  observed  in  (c)-(d). 
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(a)  The  adaptive  mesh  with  942  nodes  for 
the  forward  problem  solution  for  the  source 
located  at  (2.0,0),  obtained  by  refining  the 
adaptive  mesh  shown  in  Figure  7(a)  around 
the  detectors. 


(c)  The  adaptive  mesh  with  895  nodes 
for  the  forward  problem  solution  for  the 
source  located  at  (2.0,0),  generated  based 
on  Theorem  2. 


(e)  The  adaptive  mesh  with  691  nodes 
for  the  inverse  problem  solution,  generated 
based  on  Theorem  1. 


(b)  The  adaptive  mesh  with  955  nodes 
for  the  forward  problem  solution  for  the 
detector  located  at  (4.0, 6.0),  obtained 
by  refining  the  adaptive  mesh  shown  in 
Figure  7(b)  around  the  sources. 


(d)  The  adaptive  mesh  with  896  nodes 
for  the  forward  problem  solution  for  the 
detector  located  at  (4.0, 6.0),  generated 
based  on  Theorem  2. 


(f)  The  adaptive  mesh  with  609  nodes 
for  the  inverse  problem  solution,  generated 
based  on  conventional  error  estimates. 


Figure  8.  Samples  of  adaptive  meshes  used  in  the  third  simulation  study,  which  led 
to  the  optical  image  reconstructions  shown  in  Figure  10.  The  meshes  were  generated 
for  the  circular  heterogeneity  centered  at  (3. 0,4. 5). 
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(a)  The  absorption  image  used  as  the  reference 
in  the  error  computations. 


(c)  The  reconstructed  absorption  image  using 
adaptive  meshes  based  on  Theorem  2  for  the 
forward,  and  the  uniform  mesh  in  Figure  2(b) 
for  the  inverse  problem. 
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(e)  The  reconstructed  absorption  image  using 
adaptive  meshes  based  on  Theorem  2  for  the 
forward,  and  using  the  adaptive  mesh  based  on 
Theorem  1  for  the  inverse  problem. 


(b)  The  reconstructed  absorption  image  using 
the  uniform  mesh  in  Figure  2(a)  for  the  forward, 
and  the  uniform  mesh  in  Figure  2(b)  for  the 
inverse  problem. 


(d)  The  reconstructed  absorption  image  using 
adaptive  meshes  based  on  a  priori  error 
estimates  (3.24)  and  (3.25)  for  the  forward,  and 
the  uniform  mesh  in  Figure  2(b)  for  the  inverse 
problem. 


(f)  The  reconstructed  absorption  image  using 
adaptive  meshes  based  on  a  priori  error 
estimates  (3.24)  and  (3.25)  for  the  forward,  and 
the  interpolation  error  estimate  (4.40)  for  the 
inverse  problem. 


Figure  9.  The  reconstructed  optical  images  regarding  the  circular  heterogeneity 
centered  at  (3.0, 3.5). 
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(a)  The  absorption  image  used  as  the  reference 
in  the  error  computations. 


(c)  The  reconstructed  absorption  image  using 
adaptive  meshes  based  on  Theorem  2  for  the 
forward,  and  the  uniform  mesh  in  Figure  2(b) 
for  the  inverse  problem. 
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(e)  The  reconstructed  absorption  image  using 
adaptive  meshes  based  on  Theorem  2  for  the 
forward,  and  the  adaptive  mesh  based  on 
Theorem  1  for  the  inverse  problem. 
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(b)  The  reconstructed  absorption  image  using 
the  uniform  mesh  in  Figure  2(a)  for  the  forward, 
and  the  uniform  mesh  in  Figure  2(b)  for  the 
inverse  problem. 


(d)  The  reconstructed  absorption  image  using 
adaptive  meshes  based  on  a  priori  error 
estimates  (3.24)  and  (3.25)  for  the  forward,  and 
the  uniform  mesh  in  Figure  2(b)  for  the  inverse 
problem. 


(f)  The  reconstructed  absorption  image  using 
adaptive  meshes  based  on  on  a  priori  error 
estimates  (3.24)  and  (3.25)  for  the  forward,  and 
the  interpolation  error  estimate  (4.40)  for  the 
inverse  problem. 


Figure  10.  The  reconstructed  optical  images  regarding  the  circular  heterogeneity 
centered  at  (3.0, 4.0). 


